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PHAGOCYTOSIS GENES AND USES THEREOF 



PFT ATFn APPTJCATIONS 

This application is a continuation-in-part of and claims priority to U.K. 
Patent Application No. 9820816.8 filed September 24, 1998 and U.K. Patent 
5 Application No. 9812660.0 filed June 1 1, 1998; and is a continuation-in-part of 
and claims priority to U.S. Application No. 09/096,347, filed June 1 1, 1998 and 
U.S. Application No. 09/096,631, filed June 11, 1998; and claims the benefit of 
U.S. Provisional Application No. 60/072,324, filed January 23, 1998. The 
teachings of all of the referenced applications are incorporated herein by reference 
10 in their entirety. 

noVF.RNMKNT SUPPORT 

The invention was supported, in whole or in part, by Grant GM52540 fi"om 
the National Institutes of Health. The Government has certain rights in the 
invention. 

15 BACKGROUND TO THE I NVENTION 

Phagocytosis or engulfinent, is a specialized form of endocytosis through 
which eukaryotes take up very large particles, or even whole cells. It is a 
fimdamental biological process conserved from single-cell organisms, such as 
amoebae to mammals (Metchnikoff, E. 1891), Lectures on the comparative 

20 pathology of inflammation; delivered at the Pasteur Institute, 1891, 1968 Edition 
(New York: Dover Publication)). Initially used for the dual purpose of feeding 
and defence, phagocytosis evolved, following the emergence of mesoderm, into a 
mechanism used to protect the host against invading organisms and to clear up 
foreign particles and cell debris (Metchnikoff, 1891). Recently, the significance of 

25 phagocytosis has been extended due to its role in eliminating cells undergoing 
programmed cell death (apoptosis). Since apoptosis has been implicated in a 
number of human diseases elucidation of the regulation of this phagocytosis is 
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highly desirable since it may lead to a new route of therapeutic intervention in 
these diseases. Accordingly, a need exists to isolate a gene and protein that 
regulate phagocytosis. A further need exists for therapeutic treatment for diseases 
related to phagocytosis of apoptotic cells. 

5 SUMMARY OF THE INVENTION 

Genetic studies in C elegans have identified over a dozen genes that 
function in programmed cell death. The present inventors have used the positional 
method to clone and have functionally characterized the C. elegans gene CED-6. 
It is shown that the CED-6 protein contains a phosphotyrosine binding domain and 

10 several potential SH3 binding sites. It is further demonstrated that CED-6 acts 
within engxilfing cells, and functions to promote the removal of both early and 
persistent cell corpses. Overexpression of CED-6 can partially suppress the 
engulfinent defect of both CED-1 and CED-7, suggesting that CED-6 functions 
downstream of these two genes. CED-6 acts as an adaptor molecule in a signal 

15 transduction pathway that mediates the engulfment of apoptotic cells in C elegans. 
The present inventors have also identified isolated and characterized human CED- 
6 homologue including a splice variant thereof, which it is shown is involved in a 
similar process in mammalian cells. 

The invention provides, in isolated form, a protein which is the CED-6 

20 protein of C elegans or a protein which h£is equivalent function thereto and human 
homologues of the protein, hereinafter referred to as hi CED-6, h2CED-6, and 
h3CED-6. 

The invention further provides a functional fi-agment of CED-6, hi CED-6, 
h2CED-6 and h3 CED-6, for example, a fragment corresponding to the 
25 phosphotyrosine binding domain and/or the proline/serine rich region. 

The invention further provides an isolated nucleic acid encoding CED-6 
and human homologues of CED-6, as well as nucleic acid encoding functional 
fragments of CED-6, hi CED-6, h2-CED-6 and h3-CED-6 as described above. 
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The invention further provides nucleic acid which is antisense to any of the 
nucleic acids described above or which is capable of hybridizing to any of the 
nucleic acids described above under conditions of low, medium or high stringency 
or portions or fragments thereof. 
5 The invention further provides expression vectors comprising nucleic acid 

encoding CED-6, hlCED-6, h2CED-6, h3CED-6 or encoding functional fragments 

of said proteins as above. 

The invention fiirther provides mammalian cell-lines transfected with one 
or more nucleic acids encoding CED-6, hlCED-6, h2CED-6, and/or h3CED-6. 
1 0 The invention further provides assay methods using the proteins, nucleic 

acids and transfected cells described above to identify compounds which enhance 
or inhibit the signal transduction pathway in which CED-6, hi CED-6, h2CED-6, 
and/or h3CED-6 participate. 

The mvention further provides assay methods using the transfected cells 
15 described above to identify compounds which enhance or inhibit the expression of 
the CED-6, hlCED-6, h2CED-6 or h3CED-6 genes. 

The invention fiirther provides antibodies which react with an epitope of 
CED-6, hlCED-6, h2CED-6, and/or h3CED-6. 

The invention further provides a method of treating diseases the etiology of 
20 which may be attributed to failure of engulfment of apoptotic or other diseased 
cells such as inflammation autoimmune disease or cancer by administering to a 
patient one or more of the aforesaid proteins or nucleic acids or compounds which 
are enhancers of CED-6, hlCED-6, h2CED-6 or h3CED-6. 

The invention further provides a method of treating diseases which would 
25 benefit from a reduction in the engulfment of apoptotic cells, such as, 

neurodegenerative diseases, stroke, or sickle-cell anaemia, by administering one or 
more of die aforesaid proteins, nucleic acids or compounds which are inhibitors of 
CED-6, hlCED-6, h2CED-6, or h3CED-6. 

The invention further provides a method of diagnosis of a human or animal 
30 disease using a nucleic acid encoding CED-6, hlCED-6, h2CED-6 or h3CED-6 or 
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the complement thereof or an antibody to CED-6, hlCED-6, h2CED-6 or h3CED- 

6 to detect a genetic defect. 

The invention further provides a method of identifying proteins which 

interact with CED-6, hlCED-6, h2CED-6 or h3CED-6 in the signal transduction 
5 pathway in which those proteins participate. 

The invention further provides a fusion protein in which CED-6, hlCED-6, 

h2CED-6 or h3CED06 or a functional fragment thereof such as the 

phosphotyrosine binding domain or serine proline rich region, is fused to another 

protein such as an epitope tag or product of a reporter gene . 
1 0 The invention further provides a method of determining whether a 

compound is an enhancer or inhibitor of the signal transduction pathway in which 

CED-6 participates by observing the effect of the compound on C. elegans worms 

having altered CED-6 expression. 

RprPF nFs^rRTPTiQN of thf drawings 

15 Figure lA - IE are schematic representation of the CED-O Locus. Figure 

1 A Genetic map of CED-d, CED-6 and some genes close to and also used to map 
CED-6 are shown. Figure IB Cosmid rescue. Transgenic animals carrying 
cosmids or subcloned DNA fragments (see C, D) were examined for cell corpses 
on three fold embryos. Those who gave embryos with partial or no cell corpses 

20 were counted as rescuing transgenic Unes. Four out of tested thirteen cosmids are 
shown. Rescuing fragments are bold. Number represents # rescuing lines/ # lines 
tested. Figure IC Subcloning of F56D2 cosmid and rescue. Restriction map of 
the CED'6 region is shown on the top. In the middle, several restriction fragments 
were tested for their ability to rescue the engulfment defect caused by 

25 CED'6(nl813). Figure ID Subcloning of 10 kb Xho I fragment and rescue. 

Restriction map of Xho I fragment is shown on the top. In the middle mutations 
made on the Xho I fragment and their rescuing ability are shown. An X indicates a 
frameshift mutation (see Experimental Procedures for details). Figure IE 
Transcripts on Xho I fragments. Intron/exon structure of the transcripts on Xho I 
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fragment region. Boxes: exons; V symbol: introns. AAA: poly(A) tail. RT-PCR 
products of 5' end of F56D2.7 contain both SLl and SL2. 

Figures 2A and B shows that F56D2.7 Encodes CED-6. Figure 2A shows 
the full-length cDNA (SEQ ID NO: 1) and amino acid (SEQ ID NO: 2) of C. 
5 elegans CED-6. Double underline shows the nucleic acid (SEQ ID NO: 3) and 
amino acid sequence (SEQ ID NO: 4) of phosphotyrosine binding (PTB) domain; 
single underline indicates the nucleic acid (SEQ ID NO: 6) and the amino acid 
(SEQ ID NO: 7) sequence of the proline/serine rich region. Dashed underline 
indicates charged region. Star identifies the prolines in the PxxP signature 
1 0 sequence, empty triangles the charged residues within the dashed region. Shaded 
box indicates polyadenylation signal. Both SLl and SL2 could be added to 
transplicing acceptor site. The single base pair deletion identified in 
CED-6(nl813) is shown. Figure 2B Southern blot which revealed a RFLP on 4.1 
kb fragment from CED-6 (n2095). Xho I probe identifies an allele-specific RFLP 
1 5 in CED-6(n2095) that affect a 4. 1 kb Hind III fragment containing F56D2.7. On 
the right bottom the genomic fragments digested by Hind m on the Xho I fragment 
region is shown. On the right top Xho I fragment and three genes covered on this 
region. Three //ind m fragments, 4.1kb, 0.4 kb and 9.9 kb that should be lighted 
up on the Southern blot are indicated. On the left genomic DNA isolated 
20 independently from wild-type N2, CED-6(nl8l3) and CED-6(n2095) were probed 
with ^^P-labeled A^o I fragment. n2095 allele showed the missing of the 4.1 kb 
fragment and the extra 2.1 kb fragment. o.4 kb fragments were not affected in 
both alleles (data on a separate gel, not shown here). 

Figure 3A-C show that CED-6 Contains a Phosphatyrosine Binding 
25 Domain. Figure 3A shows that alignment of CED-6 PTB (SEQ ID NO: 4) witii 
other PTB domain. The PTB domain alignment was based on the NMR structure 
of She protein. Black boxes indicate identical amino acids showed by >50% of 
sequences. Grey boxes indicate similar amino acid showed by >50% of 
sequences. For this purpose, the following sets of amino acids are considered 
30 similar: G, A, C, S, T; E, D, Q, N; R, K, H; V, M, L, I; F. Y, W.a indicate the a 
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helices suggested by the NMR structure of She, and P the (3 sheats. Invariant 
residues (found in all sequences shown) are highhghted by star, "*". Figure 3B 
shows the comparison of CED-6 to other PTB domain containing proteins. Proline 
rich regions and charged regions next to PTB domains and other regions. PTB 
5 domains were compared in the percentage of identity. Figure 3C shows the 

evolution tree of the PTB domains. The alignment from (A) was displayed using 
Seqlab package in GCG program, and the evolution tree was grown graphically. 

Figure 4 shows results of the Genetic Mosaic Analysis for CED-d (table at 
bottom) and Cell lineage of C elegans (top). The descendence of both germline 

10 and somatic sheath cells are illustrated. Body wall muscles cells which were used 
to determine the loss of the duplication were also illustrated. The solid square 
indicates the duplication loss in germ cells, and the solid square indicates the 
duplication loss in the somatic sheath cells. The black arrow indicates the somatic 
sheath cell with the enlarged nucleoli in the distal arm of the anterior gonad. The 

15 white arrow indicates the cell corpses accumulated in the proximal arm of anterior 
gonad. 

Figure 5A-D provide results that showed that heat-shock overexprcssion of 
CED'6 cDNA rescued the engulfinent defect in both soma and germline. Figure 
5 A shows the cell death during the embryonic development. Shaded box is a 

20 histogragh indicating the number of dying cells every 50 minutes during die 
embryonic development. The arrows indicates the timing of heat shock and the 
timing to observe the engulfinent phenotype. Figure 5B shows the overexprcssion 
of CED'6 cDNA promotes the engulfinent at both the early and the late stage of 
cell death. Transgenic animals carrying the transgene, CED'6 cDNA driven by 

25 heat shock promoter were treated with heat before the cell death occurred at the 
indicated time. Cell Corpses in the head of young LI larvae were examined. The 
animals without the heat treatment were also examined. Other control experiments 
included iV2, CED-6(nl813) with or without heat treatment, and CED'6(nl8I3) 
carrying lacZ transgene treated with heat. The solid circles indicate the 

30 experiments with the heat shock after the formation of cell corpses, and the empty 
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circles with the heat shock before the cell death took place and the experiments 
without heat shock. Figure 5C shows the overexpression of CED-6 cDNA rescue 
the engulfinent defect in germline. The arrow indicates the timing for a heat shock 
when transgenic animals were at the development stage of the 24 hours after the 
5 L4 molt. Cell corpses were examined at the several time points between the time 
of heat shock and the 60 hours after the heat shock. Figure 5D shows the 
overexpression of CED-6 cDNA promotes the engulfinent many hours after the 
formation of the cell corpses in germline. Adult transgenic animals were treated 
with heat as indicated. Cell corpses were examined in one gonad arm 12 hours 
10 after the heat shock. Control experiments including N2, and CED'6(nl813) are 
indicated in (C), 

Figure 6 presents results that show overexpression of CED-6 partially 
suppresses the engulfment defect of both CED-1 and CED-7 during embryonic 
development CED-6 was overexpressed at the genetic background of three alleles 
15 of both CED-1 and CED-7. The timing for the heat shock and the timing for the 
examination of cell corpses are illustrated in figure 5 A. Animals with each genetic 
background were treated with heat before the cell death occurred or without the 
heat treatment. Cell corpses were examined in head of young LI larvae, LacZ 
was also expressed in the each genetic background. Each mutant was also treated 
20 with heat shock to examine the effect of heat on the expression of cell corpses. 

Figure 7 is a model of the epistatic pathway for the engulfinent genes 
overexpression of CED-6 did not have an obvious effect on the cell corpses 
expression on CED-2, 5 and 10 but on CED-l and CED-7. We propose that 
CED-6 might act downstream of both CED-1 and CED-7. And CED-2, 5 and 10 
25 either act in the different pathway or act downstream of CED-6. 

Figure 8 is a flow chart illustrating a Xho I fragment from F56 cosmid 
rescues the CED-6 engulfinent defect. 

Figure 9A-B are schematics that illustrate that the C05D2.7 construct is 
CED-6. Figure 9A shows the restriction Map of Xho I fragment and rescue. 
30 Figure 9B shows the transcripts. 
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Figure 10 is a bar graph illustrating that the over-expression of CED-6 
rescues the engulfinent defect of the CED-6 mutant. 

Figure 1 1 contains graphs illustrating that the over-expression of CED-6 
rescues the engulfinent defect of CED-6 mutant during embryonic development 
5 Figure 12 is a bar graph illustrating that CED-6 may also promote the 

engulfinent of persisting corpses. 

Figure 13 shows that CED-6 promotes the engulfinent of persistent cell 
corpses and probably acts within engulfing cells. 

Figure 14 is a schematic that shows that CED-6 may be an adaptor protein 
10 acting in signal transduction pathway. 

Figwe 15 shows graphs which indicate that over-expression of CED-6 
rescues the engulfinent defect in the adult gonad, and CED-6 might act in somatic 
sheath cells. 

Figure 16 illustrates that over-expression of CED-6 partially suppresses the 
1 5 engulfinent defect of CED- 1 mutants. 

Figure 17 shows that the over-expression of CED-6 cDNA suppresses the 
engulfinent defect of CED-7 mutants. 

Figure 18 shows consensus DNA sequence (SEQ ID NO: 7) of hi CED-6 
(2416bp) with start and stop codon in bold and alternatively spliced sequence 
20 underlined. 

Figure 19 shows DNA sequence (SEQ ID NO: 13) of h2CED-6 (alternative 
splice) with start and stop codons in bold. 

Figure 20 shows the amino acid sequence (SEQ ID NO: 8) of hi CED-6 
with alternatively spliced region underlined. 
25 Figure 21 shows the amino acid sequence (SEQ ID NO: 14) of h2CED-6 

(alternative splice). 

Figure 22 shows hlCED-6 cDNA (SEQ ID NO: 7) and hlCED-6 (SEQ ID 
NO: 8) amino acid sequence with PTB domain nucleic (SEQ ID NO: 9) and amino 
acid (SEQ ID NO: 10) sequences, charged region, and proline/serine rich nucleic 
30 acid (SEQ ID NO: 11) and amino acid (SEQ ID NO: 12) sequences indicated. 
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Figure 23 shows an alignment of CED-6 and hlCED-6. 
Figure 24 shows an alignment of regions of 47.5% and 3 1 .6% identity, 
respectively. 

Figure 25A Human Multiple Tissue Northern Blot (MTN), Figure 25B 
5 shows a Human Multiple Tissue Northern (MTN) Blot H, and Figure 25C shows a 
Human Cancer Cell Line Multiple Tissue Northem (MTN™) Blot. The expression 
pattern of hi CED-6 in normal human tissues and cancer cell lines by Northem 

blotting is shown. 

Figure 26 is a map of plasmid pGA3015 in which a CED-6 fragment is 

10 cloned as a C-terminal fusion to GFP. 

Figure 27 is a map of plasmid pGA3064 with CED-6 cloned as a C- 

terminal fusion of GFF. 

Figure 28A-28F is a DNA alignment (Genework) of sequenced hbc3123 
EST clone, the PCR fragment I isolated from a cDNA library, and three EST 
15 sequences identified using the PCR fragment, hbc3 123 EST clone was sequenced 
and analyzed. The three EST clones were identified through searching the 
Genbank using the isolated PCR fragment. 

Figure 29 shows the amino acid sequence (SEQ ID NO: 16) of the human 
h3 CED-6, as compared to hlCED-6 (SEQ ID NO: 8), 
20 Figures 30A-B show the nucleic acid sequence (SEQ ID NO: 15) that 

encodes human h3 CED-6, as compared to hi CED-6 (SEQ ID NO: 7). 

Figures 31A-B show that overexpression of h3CED'6 rescue an engulftnent 
defect. Figures 3 1 A shows overexpression of hCED-6 rescued the engulfment 
defect of CED'6(nl8J3) embryos. Embryos laid by transgenic mothers were heat- 
25 shocked before the wave of embryonic cell death, and scored for the numbers of 
persistent cell corpses in head of LI larvae. Each dot represents one animal. 
Figure 3 IB shows overexpression of hCED-6 rescued the germ cell engulfment 
defect of CED'6(nl813) animals. Transgenic animals were heat-shocked 36 hours 
after L4/adult molt, and germ cell corpses were scored 12 hours after heat shock. 
30 The number of animals scored is indicated on the top of each bar. 
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Figure 32A-J shows the nucleic acid sequence comparison among ESTs, 
CED-6, hCED-6, and a consensus construction of 2416 bp consensus sequence 
was done by using sequence infonnation obtained from EST RACE & colony 
hybridization. Seq was compiled by using aal 599394 as template and primers as 
5 indicated in multiple alignment. Rcc stands for the reverse complement. Both 
CED-6 and hCED-6 are indicated above the multiple alignment pGAlOl was 
picked up by colony hybridization. 

DETAILED DESCRIPTION OF THE INVENTION 

cDNAs encoding the alternative splice h2CED-6 and the additional 

1 0 sequence required to constitute h3CED-6 from h2CED-6 have been deposited at 
the Belgian Coordinated Collections of Microorganisms (BCCM) at Laboratorium 
voor Moleculaire Biologic - plasmidencollective (LMBP), Universiteit Gent, K.L. 
Ledeganckstraat 35, B 9000, Gent, Belgium in accordance with the Budapest 
Treaty on 8th June 1998 and have been accorded the Accession Nos LMBP 3868 

1 5 and LMBP 3869, respectively. 

Primers which will assist in obtaining the relevant inserts from these 
deposits are shown in Example 14. 
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AMINO ACID AND NUCLEOTIDE SEQUENCES 



SEQ. ID NO. 1 


Nucleic acid sequence of C. elegans CED-6. (e.g.. Figure 2A) 


SEQ. ID NO. 2 


Amino Acid sequence of C elegans CED-o (e.g., Mgurc /A) 


SEQ. ID NO. 3 


Nucleotide sequence encoding PTB domain of C elegans CED-6 
(e.g., Figure 2 A) 


SEQ. ED NO. 4 


Amino acid sequence of PTB domain of C elegans CED-6 (e.g.. 
Figure 2A) 


SEQ. ID NO. 5 


Nucleotide sequence encoding proline/serine rich region of C 
elegans CED-6 (e.g.. Figure 2 A) 


SEQ. ID NO. 6 


Amino acid sequence of nroline/serum rich region of C. elegans 
CED-6 (e.g., Figure 2 A) 


SEQ. ID NO. 7 


Nucleotide sequence that encodes hICED-o (e.g., r igure lA 
Figure 18) 


SEQ. ID NO. 8 


Ammo acid sequence of hlCED-6 (e.g.. Figure 20 ana rigure J,Z) 


SEQ. ID NO. 9 


Nucleotide sequence encoding PTB domain of hlCED-6 (e.g.. 
Figure 22) 


SEQ. ID NO. 10 


Amino acid sequence encodmg PTB domain of hlCED-6 (e.g.. 
Figure 22) 


SEQ. ID NO. 11 


Nucleic acid sequence that encodes the proline/serine rich region 
oi niv.,1-1^-0 v.e.^., nguic — y 


SEQ. ID NO. 12 


Amino acid sequence of the proline/serine rich regions of hlCED- 
6 (e.g. Figure 22) 


SEQ. ID NO. 13 


Nucleotide sequence that encodes h2CED-6 (e.g.. Figure lA) 


SEQ. ID NO. 14 


Amino acid sequence of h2CED-6 (e.g.. Figure 21) 


SEQ. ID NO. 15 


Nucleotide sequence encoding h3DEC-6 (e.g. Figure 30A-B) 


SEQ. ID NO. 16 


Amino acid sequence of h3CED-6 fe.s. Fieure 29) 
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a ELEGANS CED'6 

Programmed cell death has traditionally been divided into two distinct, 
sequential processes: cell killing, and the removal of dead cells. However, these 
two events are very closely linked. In vivo, cells that present an apoptotic 
5 morphology are usually akeady engulfed by other cells (Wyllie A. H. et al., 1980 
Int. Rev. Cytol 68, 251-306; Lockshin R.A. (1981) Cell Death in Biology and 
Pathology, R.A. Lockshin and I.D. Browen, eds. (London: Clapman and Hall), 
pp79-122; Duvall and Wyllie (1986). Immunol Today 7 pp 1 15-1 19; Robertson 
and Thompson (1982) J. Embryol. Exp. Morph. 62 pp 89-100; Hedgecock et al 

10 (1983) Science 222, 1277-1279; Ellis et al (1991) Genetics 122 PP 79-94;). 

Engulfment is also a swift and efficient process in the nematode Caenorhabditis 
elegans : dying cells are engulfed and completely removed by their neighboring 
cells within an hour (Sulston and Horvitz, (1977); Dev. Biol 56 pp 1 10-156; 
Robertson and Thomson, 1982). The engulfment is not necessarily by professional 

1 5 phagocytes. Rapid engulfment of apoptotic cells is important, as it prevents dying 
cells from releasing potentially harmftil contents during their lysis, which could 
damage surrounding tissue and result in an inflammatory response (Duvall et al., 
(1985) Immunology 56 pp 351-358; Savill et al,, (1989) J. Clin. Invest. S2 pp 
865-875; Grigg et al., (1991) Lancet 358 pp 720-722; Savill et al., (1993) 

20 Immunol. Today 14, PP 13 1-136). 

The nematode C elegans has been used extensively for the study of 
programmed cell death (reviewed by Hengartner, (1997) Cell Death in C elegans 
n. Plain View, Cold Spring Harbour Laboratory Press, pp 383-415). Genetic 
studies have identified over a dozen genes that function in the regulation and 

25 execution of apoptosis in C. elegans. Six genes - CED-1, CED-2, CED-5, CED-6, 
CED-7, and CED-10 - function in the engulfment of all dying cells (Hedgecock et 
al., 1983; Ellis et al., 1991; Horvitz et al., (1994) Cold Spring Harbour Symp. 
Quant Biol (1994) 52'. PP 377-385). In animals mutant for any one of these genes, 
many apoptotic cells fail to be engulfed and persist for many hours as highly 

30 refractile disks that can be readily identified under differential interference contrast 
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(DIG) optics (Hcdgecock et al., 1983; Ellis et al., 1991). None of the six 
engulfinent genes is absolutely essential for engulfment, as many dying cells are 
still properly removed in these mutants. Genetic analysis of various double 
mutants has suggested that these six genes might form two partially redundant 
5 groups, one being comprised of CED- 1 , CED-6, and CED-7; the other of CED-2, 
CED-5, and CED-10 (Ellis et al., 1991). The number of persistent cell corpses is 
increased dramatically in double mutants crossing groups, but not in those within 
the same group. Understanding how these genes are involved in regulating 
engulfment requires the elucidation of their molecular nature. 

10 In other species, several candidate apoptotic receptors have been identified 

over the past few years; these include the ATP -binding cassette transporter ABCl 
(Luciani and Oiimini, (1996), EMBG J. li pp 226-235) adhesion molecules such 
as the vitronectin receptor (Savill et al (1990), Nature MS pp 170-173) and CD36 
(Asch et al. (1987) J. Clin. Invest. 22 pp 1054-1061; Savill et al (1992) J, Clin, 

15 Invest. 2Q pp 1513-1522; Ren et al (1995) J. Exp. Med. 1& 1857-1862), Drosophila 
croquemort (Franc et al., (1996), Immunity 4, pp 431-443 class A scavenger 
receptors (Piatt et al., (1996), Proc. Natl. Acad. Sci. USA 21 pp 12456-12460) 
lectins (Duvall et al., (1985), and a predicted receptor that can recognize 
phosphatidylserine on the outer leaflet of apoptotic cells (Fadok et al., (1992) J. 

20 Immunol. MS pp 2207-2216; Fadok et al (1992) J. Immunol 142 pp 4029-4035). 
Currently little is known about the molecules used by engulfmg cells to transduce 
signals from surface receptors to the cytoskeleton, or how these molecules regulate 
the local cytoplasmic rearrangements and dynamic extensions that are required for 
phagocytosis (Savill et al., 1993). A genetic analysis of engulfment in C. elegans 

25 could identify genes involved in these processes. Indeed Wu and Horvitz (1998) 
(Nature 222 pp 501-504) showed that C elegans CED-5 is homologous to human 
DOCK180, and might regulate cytoskeleton rearrangement during engulfment. 

The process of apoptosis has been implicated in the etiology, or associated 
with the pathology, of a wide range of diseases, including cancer, autoimmune 

30 diseases, various neurodegenerative diseases such as Amyotrophic Lateral 



wo 99/37770 



-14- 



PCT/US99/01361 



Sclerosis, Huntington's Disease, and Alzheimer's Disease, stroke, myocardial 
heart infarct, and AIDS (Thompson, (1995) Science 267 pp 1456-1462). Thus, a 
better understanding of the molecular events that underlie apoptosis might lead to 
novel therapeutic interventions. While much of the current attention is centered on 
5 the genes and proteins that control the killing step of the death process, it is very 
likely that the removal of apoptotic cells will prove to also be crucial for the proper 
overall functioning of the apoptotic program, and will offer another entry point for 
therapeutic intervention (as described herein). 

The process of recognition and engulfment of dying cells is extremely swift 

10 and efficient. In animals, it is essentially impossible to find a cell with apoptotic 
features that is not abready within another cell. Such rapid recognition and 
phagocytosis of apoptotic ceils is a crucial aspect of programmed cell death in 
vivo: unengulfed apoptotic bodies can undergo secondary necrosis, leading to 
inflammation. Failure to remove apoptotic bodies also exposes the body to novel 

15 epitopes (from e.g., caspase-generated protein fragments), possibly encouraging 
the development of autoimmune disease. Persistent apoptotic bodies can often be 
observed following chemotherapeutic intervention (which leads to extensive 
apoptosis) and are particularly abundant in solid tumors, in which clearance of cell 
corpses might be delayed. 

20 In addition to their ability to recognize and engulf apoptotic cells, 

professional phagocytes carry specific surface receptors, such as the Fc (Ravetch, 
(1994) Cell 2S 553-560; Greenberg et al., (1993) J. Exp, Med. 177 pp 529-534) 
and C3 (Bianco et al., (1975) J. Exp. Med. 141 pp 1278-1290; Greenberg, (1995) 
Trends in Cell Biol. 1 pp 93-99) receptors, which recognize antigen-opsonized 

25 particles and trigger their phagocytosis. Inhibitor studies have shown that Fc 
receptor-mediated phagocytosis requires tyrosine phosphorylation (Greenberg et 
al., 1993; Greenberg, 1995). The work of the present inventors suggests that the 
engulfment of apoptotic cells could be also mediated by a tyrosine kinase signal 
transduction pathway. While these two pathways clearly use distinct receptors at 

30 the cell surface, they must eventually converge on the same downstream 
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engulfinent machinery, and could thus share at least some common signal 
transduction molecules. 

The invention relates to an isolated protein which is an adaptor molecule in 
a signal transduction pathway which regulates phagocytosis of apoptotic cells. 

5 In a particular embodiment, the invention pertains to an isolated protein 

from the nematode worm C. elegans which is an adaptor molecule acting in a 
signal transduction pathway which promotes phagocytosis of apoptotic cells, 
which protein comprises the amino acid sequence show in Figure 2A (SEQ ID 
No: 2) or an amino acid sequence which differs from Figure 2A only in 

1 0 conservative amino acid changes. As aforesaid the amino acid sequence shown in 
Figure 2 A is that of the C. elegans CED-6 protein with its encoding DNA also 
shown. 

In another of the aspects the invention comprises a nucleic acid comprising 
a sequence of nucleotides which encodes the amino acid sequence of Figure 2A, 
1 5 (SEQ ID No: 2) for example, a sequence of nucleotides from about nucleotide 
position 22 to about nucleotide position 1500 of Figure 2A or the entire sequence 
of nucleotides shown in Figure 2A. 

In a ftuther embodiment of the invention there is provided an isolated 
protein which is a fragment or portion of a protein having the amino acid sequence 
20 of Figure 2 A or of a protein having an amino acid sequence which differs from 
that shovra in Figure 2A only in conservative amino acid changes. For example, 
the portion may comprise an amino acid sequence corresponding to the 
phosphotyrosine binding domain (SEQ ID No: 4) (about amino acid 46 to about 
amino acid 193 in Figure 2A) or an amino acid sequence corresponding to tiie 
25 proline/serine rich region (SEQ ID No: 6) (about amino acid 242 to about amino 
acid 339 in Figure 2A). 

Nucleic acids (SEQ ID Nos: 3 and 5 respectively) encoding the PTB 
domain or the proline/serine rich region of the C. elegans CED-6 protein are 
encompassed by the claimed invention. 
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In yet a further aspect of the invention there is provided an isolated nucleic 
acid capable of hybridizing to the sequence of nucleotides of SEQ ID Nos: 1, 3, 5, 
7, 9, 11, 13, 15 under conditions of low, medium or high stringency. It is to be 
understood that low stringency means approximately: 0.2 to 2xSSC; 0.1% SDS; 
5 25° to 50*^0. 

In a further embodiment of the invention there is provided a fusion protein 
which comprises as part of the fusion a protein having an amino sequence of SEQ 
ID No: 2, 4, 6, 8, 10, 12, 14, or 16 or an amino acid sequence which differs from 
the amino acid sequence shown in SEQ ID Nos: 2, 4, 6, 8, 10, 12, 14, or 16 only in 

10 conservative amino acid changes. The protein may be fused to, for example, an 
epitope tag or the expression product of a reporter gene. 

In yet a further aspect the invention provides expression vectors comprising 
any of the nucleic acid sequences of SEQ ID Nos: 1, 3, 5, 7, 9, 1 1, 13, 15. 
Preferably, the vectors incorporate a reporter gene such as green fluorescent 

15 protein which is positioned relative to the nucleic acid of the invention such that 
expression of the nucleic acid results in expression of the reporter gene. 
Preferably, a fusion of CED-6 and the reporter gene is expressed. 

It is to be understood that the term "nucleic acid" as used herein may 
include genomic DNA, RNA and cDNA. 

20 Positional cloning methods were used to clone the C elegans CED-6 gene 

and determine the nucleotide sequence. In addition they have functionally 
characterized the protein. By searching publicly available protein sequence 
databases, it has been determined that the CED-6 protein has in the N-terminal half 
a putative phosphotyrosine binding domain and in the C-terminal half a 

25 proline/serine rich region which is a potential SH3 binding domain. 

Genetic mosaic analysis, as well as rescue and over-expression 
experiments, have shown that CED-6 acts autonomously within engulfing cells 
and promotes engulfment of apoptotic cells. Further database searching has 
confirmed the functional regions to be surprisingly evolutionally conserved. Thus, 
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the inventors have now cloned two human homologues of the C. elegans CED-6 
gene and shown them to have equivalent function. 
Molecular Cloning of C elegans CED-6 

Previous genetic mapping experiments by Ellis and Colleagues (Ellis et al, 
5 (1991) (Genetics 129 pp 79-94) have placed CED-6 gene close to the daf-4 locus 
on chromosome three (Figure 1 A). The region around daf~4 has been mostly 
sequenced by the C elegans genome sequence consortium (Wilson et al, (1994) 
Nature 368 pp 32-38), To determine the exact physical location of CED-6 ^ the 
present inventors collected thirteen overlapping cosmids in this region which 

10 together are roughly 0.3 Mbp. Using the germline transformation method (Mello 
and Fire, (1995), methods in cell biology (San Diego Academic Press) pp 452- 
482)these cosmids were tested for their ability to rescue the engulfinent defect of 
CED-6(nl813)y by scoring three-fold embryos laid by transgenic animals for the 
presence of persistent cell corpses. Three fold embryos were chosen for the initial 

15 study because cell corpses are numerous and easily seen at this stage of 

development. Two overlapping cosmids F56D2 and F43F12 were found to be 
able to rescue the engulfinent defect of CED-6(nl813), The further rescuing 
experiments using the DNA fragments from F56D2 were identified to contain the 
rescuing activity. 

20 The gene prediction program GENEFEMDER™ suggested that this region 

contains two genes, which the C elegans genome sequence consortium submitted 
to Genbank under the names F56D2.7 and C05D2.6. Using a combination of RT- 
PCR and screening of cDNA libraries (see below) the existence and predicted 
intron/exon pattern of F56D2.7 was confirmed. However, the inventors found that 

25 C05D2.6, rather than corresponding to a single gene, actually corresponds to two 
genes and the short distance (»>bp) between the end of the upstream transcript 
and the start of the downstream transcript suggested that C05D2.6A/B might be a 
two-gene operon (Zorio et al (1994) Nature 372 pp 270-272.). It was found that 
C05D2.6B is trans-spliced to the "downstream" splice leader SL2, whereas the 
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upstream transcript C05D2.6A is trans-spliced to the more common SLl splice 
lezider (Figure IE). 

The CED-6 Locus 

To determine which one of the three genes present on the Xho I fragment 
5 corresponds to CED'6, a number of constructs were generated containing internal 
deletions or point mutations. The deletion of most of the C05D2.6A/B operon had 
no deleterious effect on CED-6 rescue, whereas the introduction of a frameshift 
mutation within exon 3 of F56D2J abolished the fragment's rescuing activity 
(Figure IE). To exclude the possibility that F56D2.7 might be a multicopy 

10 suppressor of CED-6, and to confirm suspicions that F56D2.7 might correspond to 
the CED-6 locus, the two known CEIJ-d alleles, nl813 and n2095 were analysed 
for any nucleotide changes within this region. Southem blot analysis revealed an 
allele-specific restriction fragment length polymorphism affecting F56D2.7 in 
CED-6(n2095) mutants (Figure 2A). Based on the hybridization patterns observed 

1 5 in n2095, a single nucleotide deletion in exon 4 of F56D2.7 in CED-6(nl813) was 
also identified. This mutation should result in a reading frame shift and a 
truncated protein (Figure 2B). Taken together, the genomic rescue and mutation 
data strongly suggested that F56D2.7 corresponded to CED-6, 

Identification of CED-6 Transcripts 

20 To confirm the predicted intron/exon structure for CED-6, the present 

inventor screened a mixed-stage cDNA library and identified 10 clones 
corresponding the CED-6 gene. Several of these contained splice leader SL2 
sequences at the 5* end, suggesting that CED-6 might also be a downstream gene 
in an operon. RT-PCR was performed on mixed-stage RNA using both SLl and 

25 SL2 trans-splicing leaders as primers for the PGR step. Interestingly, sequence 
analysis of the PCR-amplified fragments revealed that both SLl and SL2 trans- 
splicing leaders can be found at the 5' end of CED-6 transcripts (Figure 2B). The 
upstream gene in the CED-6 operon is the predicted gene F56D2. 1 . The presence 
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of SLl -trans-spiced mRNA suggests that CED-6 might also be transcribed from a 
second downstream promoter, independently of the upstream gene. The existence 
of a downstream promoter could explain why the Xho I fragment could rescue 
CED-6 mutants even though it does not contain the whole CED-6 operon. 

5 CED-6 Protein Contains a Phospho tyrosine Binding (PTB) Domain and a 
Proline/Serine Rich Region 

The full-length CED~6 cDNA is predicted to code for a 492 amino acid 
protein (Figure 2B). A search of public sequence database with the predicted 
CED-6 sequence indicated that the N-terminal half of CED-6 contains a putative 

10 phospho-tyrosine binding (PTB) domain. PTB domains can promote binding to 
phosphorylated tyrosine residues located within an appropriate primary sequence 
context. The PTB domain is similar in function, but distinct in structure from the 
SH2 domain. The present inventors have aligned the CED-6 PTB domain with the 
PTB domains found in a number of other proteins (Figure 3A). Secondary 

15 structure prediction programs suggest that most of these structural elements also 
exist in the CED-6 PTB domain. 

In addition to its similarity to known proteins, the CED-6 PTB domain also 
showed significant sequence similarity to the predicted translation products of a 
number of expressed sequence tags (ESTs; Figure 3 A, B). In fact, the degree of 

20 similarity between CED-6 and a number of these ESTs was much higher than 
between CED-6 and any previously characteri2:ed protein (Figure 3 A, 3B). 
Furthermore, in several cases, the sequence similarity between CED-6 and ESTs 
extended beyond the PTB domain (Figure 3B). CED-6 also contains a 
proline/serine rich region at its C-terminal half, with 42% serine over a 24 amino 

25 acids stretch and clusters of proline-rich regions (Figure 2B, Figure 3B). These 
proline-rich regions were characterized by several sequence signatures of PxxP 
(Figure 2A), which has been shown to promote interaction with SH3 domains 
(Ren et al, (1993); Yu et al (1994) Cell 76 pp 933-945,; Grabs et al (1997) J. Biol, 
Chem. 272 pp 13419-13425). Between the PTB and proline-rich regions is a short 
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Stretch rich in charged residues(41% charged amino acids over 46 amino acids). 
This highly charged region is also found in several other PTB domain containing 
proteins, including mouse p96. She, and C. elegans Ml 10.5 (Figure 3B). 

Conservation of CED-6 Amongst Species 
5 It was found that these EST clones also shared the homology region 

beyond the PTB domain with the CED-6 protein. A C. Briggsae EST clone has 
72% identity to CED-6 over 132 amino acids at the N-terminus, and 64% identity 
to CED-6 over 103 amino acids at the C-terminus (Figure 3B). Three overlapping 
human EST clones were also obtained and constructed into one sequence. The 

1 0 human EST fusion sequence showed -54% identity to PTB domain of CED-6, and 
also contains a highly charged region right after the PTB domain. The evolution 
tree based on the alignment of PTB domains showed that CED-6 formed a 
subgroup with EST clones from human, Drosophila, and C, Briggsae, suggesting 
that these proteins might be functionally conserved. Mouse p96, Drosophila 

15 Disabled, and C. elegans Ml 10.5 formed another subgroup (Figure 3C). The tree 
also indicated that the She subgroup is more similar than the p96 subgroup to 
CED-6 subgroup. 

CED-6 Acts Cell-autonomously Within Engulfing Cells 

A genetic mosaic analysis was performed to determine if CED-6 acts 

20 within engulfing cells or dying cells. For convenience, a pair of cells on adult 
gonad, germ cells and somatic sheath cells (Figure 4A) were used. During 
oogenesis large number of oocj^es undergo programmed cell death, and normally 
these dying cells are engulfed by somatic sheath cells (Hengartner,1997). In this 
analysis a mosaic pattern of genetic background for CED-6 and wild type between 

25 germ cells and somatic sheath cells was generated. Ncl-I mutant was used for the 
identification of the mosaic pattern in the single-cell resolution since in the Ncl-J 
mutant somatic cells of animals exhibit abnormal enlarged nucleoli, which can be 
easily identified under Normaski optics (Herman, 1984; Genetics 108 pp 165-189; 



wo 99/37770 



-21- 



PCT/US99/01361 



Hedgecock and Hennan, 1995 Genetics 141 pp 989-1006). A strain was 
constT\xcXQddpy-17(el64) CED-6(nl813) meC'14(u55) ncl'l(el865) 
unC'36(e25 1)111: sDp3, This worm strain showed a wild type phenotype since the 
sDp3(III;J) duplication covers all these mutations (Rosenbludi et al, (1985) 

5 Genetics 109 pp 493-5 11). To identify the animals with CED-6 mutant germ cells 
and wild-type somatic sheath cells, animals must be found with the duplication 
loss from any of P2, P3 and P4 lineages but not from EMS, MS or any lineages 
below the MS which would lead to the loss of the duplication in somatic sheath 
cells (Figure 4). These animals can be obtained by looking through many animals 

10 of the constructed strain for the animals laying only Dpy Unc progenies. The 
animals with the loss of the duplication in PI lineage also lay only the Dpy Unc 
progenies, however these animals are not mosaic animals for the present purpose 
since the loss of the dupUcation in PI lineage results in the CED-6 mutant 
background in both germ ceUs and somatic sheath cells. From 1,000 dpy-1 7(el 64) 

15 CED-6(nl813) meC'14(u55) ncl-I(eI865) unC'36(e25 1)111; sDp3 animals, six 
animals were identified laying only Dpy Unc progenies. Observation of these six 
animals under Normaski optics indicated that one animal had the duplication lost 
in P4, one in P3, three in P2, and one in PI . All five animals displayed no cell 
corpses in gonad except the one with the duplication lost in PI, suggestmg that 

20 CED'6 is not required in germline for engulfment. Since the chance for loss of the 
duplication in all cell divisions is approximately the same (Hedgecock and 
Herman, 1995), the rate of the sDp3 loss is 0.15% per cell division. Animals were 
then looked for with the CED'6 mutant somatic sheath cells and wild-type germ 
cells. From 500 animals four animals were identified with enlarged nucleoli in the 

25 somatic sheath cells in one arm of the gonad (Figure 5B), and all four animals did 
not have the duplication lost in the lineage generating germ cells (Figure 4). Three 
animals appeared to have die duplication lost in sheath cells in the anterior arm but 
not in the posterior arm. And the accumulated cell corpses were only observed 
within the anterior gonad arm, but not the posterior gonad arm of these animals 

30 (Figure 4, Table). One animal had the duplication lost in the sheath cells 
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surrounding the posterior gonad arm, but not in that surrounding the anterior arm. 
This animal had cell corpses accumulated within the posterior arm but not the 
anterior arm (Figure 4). These results suggest diat CED'6 is required for somatic 
sheath cells, or engulfing cells to eliminate the dying cells in adult gonad. 

5 CED'6 Promotes the Engulfinent of Embryonic and Germ Cell Corpses 

To unambiguously demonstrate that F56D2,7 cDNA indeed corresponds to 
CED'6, the inventors tested whether die full-length F56D2.7 cDNA can rescue the 
engulfment defect oi CED'6 mutants, and transgenic animals were generated 
carrying the F56D2.7 cDNA under the control of the C elegans heat shock 

1 0 promoters hsp- 1 6.2 and hsp- 1 6.48 (see Examples) Used together, these two 
promoters drive expression in almost all somatic cells, including both cells that 
normally undergo programmed cell death and cells that normally engulf the dying 
cells. To test for rescue, embryos laid by transgenic mothers were exposed to a 
brief heat shock pulse just prior to the appearance of the first developmental cell 

15 deaths, and scored the number of persistent corpses visible in the heat-shocked 
animals after hatching (Figure 4). As expected, over-expression of F56D2.7 
cDNA significantly and specifically reduced the number of persistent cell corpses 
visible in CED'6 mutants, confirming that F56D2.7 is the relevant gene affected 
by the mutations that we detected in CED-6(nl813) and CED-6(n2095) mutants. 

20 Rescue of F56D2.7 cDNA in germhne was also tested (Figure 5C). Adult 
hermaphrodites were exposed to a brief heat shock pulse just prior to the 
appearance of the germline cell death, and scored the number of persistent cell 
corpses 12 hours and beyond after the heat shock. No cell corpses were found in 
gonads of the majority of animals, suggesting that CED'6 cDNA can also rescue 

25 the engulfment defect of CED-6 in germline. 

Recognition and engulfinent of apoptotic cells is a very early event in C 
elegans programmed cell death (Robertson and Thomson, (1982) J. Embryol. Ex. 
Morph 67 pp 89-100). In CED'6 mutants, the extension of cytoplasm is blocked, 
resulting in the persistence of cell corpses (Ellis et al, 1991). These cell corpses, 
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however disappeared jfrom the animal eventxially . To determine whether CED-6 
acts only in a narrow time-window at the early stage of cell death or whether the 
signal transduction pathway can be used to engulf cell corpses formed many hours 
after cell death takes place, the inventors tested whether F56D2,7 cDNA promotes 
5 the engulfinent of persistent cell corpses. CED-6 was over-expressed three hours 
before the embryos hatch, when most of cells dying by programmed cell death 
during the embryonic development have been dead approximately for five hours 
(Figure 5A), and examined cell corpses three hours after the heat-shock on the 
head of LI larvae. The number of cell corpses was found to be suppressed 

10 significantly (Figure 5B). The control experiments with either no heat treatment, 
or over-expression of lacZ showed no obvious effect on the corpse expression, 
suggesting that over-expression of CED-6 can promote the engulfinent of cell 
corpses in soma (Figure 5B). The inventors also tested if over-expression of 
CED-6 could promote the engulfinent of cell corpses formed hours after the cell 

15 death in the germline (Figure 5D). Adult transgenic animals carrying CED-6 
cDNA driven by the heat shock promoters were heat structured at several time 
points after the accumulation of cell corpses in gonad and the number of cell 
corpses 12 hours after the heat shock were examined. It was found that cell corpses 
could be removed sufficiently at all time points, suggesting that over-expression of 

20 CED-6 can promote the engulfinent of cell corpses accumulated in germline for 
hours, even days (Figiwe 5D). The present inventors have concluded that the 
signal transduction pathway in which CED-6 is involved can carry on the task of 
removing cell corpses, and there is no specific time-window for CED-6 to act 
during the process of programmed cell death. 

25 Mosaic CED-6 Protein Expression Supports That CED-6 Acts Within Engulfing 
Cells 

The invention includes methods to detect quickly if CED-6 acts within 
engulfing cells. This method is based on dying cells' failing to express proteins so 
as to generate a mosaic pattern of protein expression. However, this idea can be 
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only applied to the soma, but might not to the germiine, since in germline all germ 
cells share one syncytial cytoplasm (Hirsh et al, (1976) Developmental Biology 49 
pp 200-210), so those germ cells carrying the transgenes could contribute the 
expressed proteins into the cytoplasm, subsequently all newly formed oocytes. 
5 However the mosaic pattern of the protein expression can be generated in the 

germline because the transgenes have been found not to be expressed well in germ 
cells. The expression pattern of heat shock promoters in gonad were examined. 
Adult animals carrying the lacZ transgenes driven by heat shock promoter were 
applied heat shock 24 hours after L4 molts. The lacZ expression by beta-gal 

10 staining in both germ cells and sheath cells was subsquently examined. It was 
found that somatic sheath cells were stained blue and the stain could last 60 hours 
after the heat shock, but not the germline at any time point after the heat shock, the 
similar result was also observed in previous studies (Stringham et al, (1992) 
Molecular Biology of the cell 3 221-233). The expression of CED-6 in germline 

15 upon heat shock was also examined for three- fold embryos laid by heat-treated 
transgenic animals for the rescuing activity of the engulfinent defect. It was found 
that the majority of embryos had the CED-6 mutant phenotype, suggesting that 
CED-6 is not expressed well in germline. That CED-6 transgene in gonad is not 
expressed very well provided a useftil tool to test if CED-6 acts within the somatic 

20 sheath cells. As described in Figures 4 and 5, cell corpses were not observed in 
majority of animals in gonad at the different time point after the heat treatment, 
and the phenomenon lasted until 60 hours or beyond after the heat treatment (5C). 
In contrast to this result, without the heat treatment these transgenic animals had 
cell corpses accumulated in gonad, similar to that of the CED-6(nl813) mutant. 

25 Over-expression of lacZ didn't affect the expression of cell corpses of CED-6 

mutant, either (5C). These results support the conclusion from the mosaic analysis 
that CED-6 might act within engulfing cells, the somatic sheath cells. This method 
provides a simple way to detect if a gene acts within engulfing cells or dying cells. 
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Site of active of CED-6 in relation to CED-1 and CED-7 

To understand if CED'6 genetically interacts with any other engulfinent 
genes, CED-6 was over-espressed at the genetic background of CED-l. 7,2, 5, 
and 10. The extra-chromosomal arrays carrying CED'6 cDNA driven by heat 
5 shock promoters were transferred from CED-6(nl813) background to wild-type 
N2 background, and subsequently to CED-L 7, Z 5, and 10 mutant background. 
CED-6 was then over-exposed by following the method used for the rescue of 
CED'6 engulfinent defect by the over-expression of CED-6 cDNA as described in 
Figure 5 A, It was found that over-expression of CED-6 could partially suppress 

10 the engulfinent defect for CED-7 (nl 997). To understand if the suppression is 
allele-specific, two additional alleles, CED-7(nl996) and CED-7(nl892), were 
tested and similar results were achieved, suggesting that the suppression is not 
allele-specific (Figure 6). For the same purpose three alleles of CED-7, nl306, 
nl995, and nl735, were also tested it was found that over-expression of CED-6 

15 could partially suppress the engulfment defect of three alleles of CED-1 (Figure 6). 
Several control experiments were performed to confirm that these rescue were 
specific for CED-6. Transgenic animals with CED-6 transgene without heat 
treatment were tested; over-expression of lacZ at CED-l or CED-7 engulfinent 
mutant background was also tested. Results showed that the similar numbers of 

20 cell corpses were achieved as that of the CED-l or CED-7 mutants. Heat 
treatment reduced the expression of cell corpses for CED-7 (nl 997). Over- 
expression of CED-6 reduced the expression of cell corpses even more. These 
data suggest that the partial suppression of the engulfinent defect of both CED-1 
and CED-7 are specific for CED-6, It was also observed that over-expression of 

25 CED-6 did not have obvious effect on the number of cell corpses for CED-2, 5 and 
10. These results suggested that CED-6 might act downstream of both CED-1 and 
CED-7, and CED-2, 5 and 10 act either downstream of CED-1, d, and 7 or in a 
different pathway (Figure 6). 
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The Regulation of the CED-6 Expression 

SL2 was detected at the 5* end of the CED-6 cDNA, suggesting that CED-6 
is a downstream gene of an operon (Huang and Hirsh, (1989); Proc, Natl, Acad. 
ScLUSA 86 pp 8640-8644; Spieth et al (1993) Cell 73 pp 521-532; Zorio et al 
5 (1994) Nature 372 pp 270-272; Blumenthal et al (1995) TIG II pp 132-136). The 
inventors have shown previously that a 10 kb Xho I fragment can rescue the 
engulfinent defect of die CED'6 mutant. The fragment, however contains only 
CED-6, the downstream gene of an operon, but not the upstream one. The 
expression of CED'6 might rely on the 1 kb upstream region of CED-6 gene, a 
10 intergenic region of the operon. The Intergenic region of a operon sometimes 
could be used as a promoter for the expression of the downstream gene 
(Blementhal and Steward, (1997 C.elegans II) (Cold Spring Harbor; Cold Spring 
Harbor Laboratory Press pp 1 17-145) 

CED-6 is an Adaptor Molecule Acting in the Signal Transduction Pathway of the 
15 Engulfinent 

Protein phosphorylation is a well-defined "switch" mechanism for cells to 
deliver signals from one protein to another, and it is essential to transduce 
extracellular signals inside cells. PTB domain is another domain besides the SH2 
domain to be able to interact with a phosphorylated tyrosine residue (Kavanaugh 

20 and Williams, (1994) Science 266; Blaikie et al, (1994) J.Biol.Chem 269 32031- 
32034). Several proteins containing PTB domains have been found to act as 
adaptor molecules in the signal transduction pathway. These include She, Sck, 
Numb, FE65, disabled, DOC-2, P96 and IRS-1 (Bork and Margolis, (1995) Cell 
80 pp 693-694); Geer and Pawson, (1995) TIBS 20 pp 277-280). The proline rich 

25 region from many proteins have been shown to form multiproline helix and 

interact with a SH3 domain (Ren et al, 1993; Gout et al, (1993) Cell 75 pp 25-36; 
Yu et al, 1994). Both biological analysis and analysis of the crystal structure of 
the SH3 binding domain suggested that the sequence signature, PxxP, was 
essential for its interaction with the SH3 domain (Ren et al, 1993; Yu et al, 1994; 
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Grabs et al, 1997). CED-6 contained stretches of proline rich regions containing 
the PxxP signature, suggesting its potential to interact with the SH3 domain. 
CED-6 is an adaptormolecule that directly or indirectly transduces the signal from 
receptors to effectors or cytoskeleton molecules to initiate the engulfment process. 

5 The Interaction Partners of CED-6 

The PTE domain has been shown to interact specifically with a NPXY(p) 
motif (Kavanaugh and Williams, 1994; Zhou et al, (1995) Nature 378 pp 584-592; 
Geer and Pawson, 1995). Many receptors such as EGF receptor, TrkA, insulin 
receptor, IGF-1 receptor contain this motif at the carboxyl terminal (Geer and 
1 0 Pawson, 1 995). Signals from these receptors have been shown to be transduced 
through the interaction of a phosphotyrosine residue of this motif with PTE 
domains of adaptor molecules, such as She and insulin receptor substrate 1. The 
inventors found that in the intracellular region of CED-7 there was a NPXY(p) 
motif. CED-7 has been suggested to act in the same genetic pathway with CED-6 
15 (Ellis et al, 1991). The inventors have shown that CED-7 might act upstream of 
CED-6 (Figure 7). CjED-7 encodes a ABC transporter, and its mammalian 
homologue, ABCl was found to be required for the macrophage to engulf dying 
cells (Luciani and Chimini, 1996), suggesting that CED-7 might act within 
engulfing cells. It is possible for CED-6 to physically interact with CED-7 
20 through a PTE domain with NPXY(p) motif of CED-7 to regulate the signal 
transduction of engulfinent process. 

CED-6 also contains a proline/serine rich region with several sequence 
signature PxxP, which might mediate its interaction with the SH3 domain. The 
SH3 domain has been suggested to mediate protein-protein interactions between 
25 signaling molecules downstream of membrane-bound receptors (Koch et al, (1991) 
Science 252 pp 252-673; Pawson and Schlessinger, (1993) Current Biology 3 pp 
434-442. A SH3 domain containing protein is likely to interact with CED-6 and to 
regulate the signal transduction pathway of engulfment. Several proteins might 
directly or indirectly interact with CED-6 protein. CED-l might act upstream of 
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CED-(5(Figure 6 & 7A). The relationship between CED-l and CED-6 will depend 
on the cloning of the gene. A protein with a phosphorylated tyrosine residue 
should exist to interact with the PTB domain of CED-6. This phosphorylated 
protein is either a tyrosine kinase or a substrate of a tyrosine kinase, and a tyrosine 
5 phosphatase should also be involved in the signal transduction pathway of 

engulfinent to down-regulate the activity of the phosphorylated proteins. Some 
studies on phagocytosis in mammalian system have shown that a tyrosine kinase 
signal transduction pathway might play an essential role in the opsonin-mediated 
phagocytosis process (Roshenshine and Finlay, (1993) BioEssays 15 pp 17-24; 
10 Greenberg, (1995) Trends in Cell Biology 5 pp 93-99. The present results suggest 
that it might be the same case for the PCD triggered engul&nent. These two types 
of phagocytosis might share some similarity at the end. 

CED-6 Acts Within Engulfing Cells 

A genetic mosaic analysis has been performed to determine that CED'6 

15 acts within engulfing cells. This conclusion was drawn based on the observation 
of a pair of cells, germ cells and somatic sheath cells. We have shown previously 
that over-expression of CED-6 can promote the engulfinent of cell corpses. Since 
cells that have been dead for many hours are very unlikely to maintain their ability 
for protein expression (Estus, 1994; Freeman, 1994), the rescue of cell corpses is 

20 most likely to be due to the expression of CED-6 within the engulfing cells. This 
result suggests that CED'6 also acts within the engulfing cells in the soma. 
Previously it has been shown by the inventors that over-expression of CED'6 
could rescue the engulfinent defect of CED-6 in both soma and germline (Figure 
5)> suggesting that CED'6 acts in a similar mechanism in both places. 



25 CED-6 Can Promote the Engulfinent of Cell Corpses 

Over-expression of CED-6 promotes the engulfinent of dying cells at a 
very early stage of the cell death, and cell corpses formed hours after the cell 
death. Cell corpses have been shown to have a typical morphology of apoptotic 
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cells, for instance, membrane blebing. The antigens presented on the membrane 
surface of cell corpses for their recognition by engulfmg cells might be somewhat 
different from that on the membrane surface of the early dying cells. Irrespective 
of ligands on dying cells and receptors on the engulfing cells are the same or not in 
5 both situations, CED-6 is required for the engulfment. A few cell corpses in the 
gonad were not removed upon heat shock for some animals later after the heat 
shock. These corpses tend to be located in between oocytes and closed to the 
spermatheca. The failure of the engulfment of these cell corpses might be due to 
their lack of contact with the sheath cells. It is concluded that cell corpses, just 

10 like dying cells at the early stage of the PCD, can trigger phagocytosis. In mec'4 
mutant animals six touch sensory neurons die of necrotic death due to a channel 
defect leading to an impaired osmotic pressure in these ceils (Driscoll and Chalfie, 
(1991) Nature 349 pp 588-593). Chung and Driscoll showed that the removal of 
the swelling dead cells was delayed significantly at the CED-6 background, 

15 implying that CED-6 is also involved in the removal of necrotic dying cells. Thus, 
there might be similar signals presented on the surface of dead cells to allow them 
to be recognized by engulfmg cells regardless the manner of the death; and the 
signal transduction pathway in which CED-6 is involved can be used to respond to 
these signals to cause engulfment. The fact that engulfment is triggered so early 

20 and is completed so swiftly is a clever design of nature, it is important especially 
for tissues with massive cell death. 

Conservation of the Engulfment Program 

In an alignment, an EST clone from C. Briggsae is highly conserved with 
CED-6 in both the N- and C-terminal region, suggesting diat this EST clone might 
25 represent a real CED-6 homologue (Figure 3B). EST clones for Drosophila and 
human are also highly conserved to CED-6 but mainly in the region of PTE 
domain (Figure 3 A & 3B). This result suggested the possibility for these PTB 
domain proteins to be functional homologues of CED-6 in those specimens. As a 
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result two human homologues of C.elegans CED-6 gene have been cloned and 
characterized. 

Expression Vectors and Transfected Mammalian Cells Expressing CED-6 

Fragments of C.elegans CED-6 DNA was inserted into commercially 
5 available vectors, including vectors having the reporter gene, green fluorescent 
protein (GFP), are set out in table 1 below; 

TABLE 1 

GFP-CED-6 expression in MCF7 
Cloning of CED-6 fragments in pEGFP 









from .. 


. (bp) - to .... 


(b-) 






Vector 


2-1591 


22-1492 


598-1581 


598-1494 


22-745 


744-1581 


744-1494 


TA-PCR 


pGAl 


pGA2 


pGA3 


pGA4 


pGA5 


pGA6 


pGA7 


pAS2 


pGAlOll 




pGA10I3 










pGAD414 
















pEGFP-Cl(*) 


pGA3011 




pGA30I3 




pGA3015 






pEGFP-C3(*) 












pGA3036 




pEGFP-N3(*) 










pGA3045 






pEGFP-N3(*) 




pGA3062 




PGA3064 






pGA3067 



♦are commercially available from Clontech 



Visualization GFP fluorescence in MCF7 cells 
20 Human breast cancer cells, MCF7 (ATCC: HTB-22), were seeded in Lab 

Tek chambered coverglass (Nalge Nunc International) and transfected using 
lipofectAMINE (GibcoBRL). After 18 hours, the chambered coverglasses where 
placed on a inverted microscope, and GFP fluorescence could be visualized. 

Expression of GFP-CED-6 
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Subcellular localization of worm CED-6 was assayed using GFP fusion 
proteins. By using different fragments the inventors showed that CED-6 has a clear 
cytoplasmic localization. This localization was aboHshed when only the PTB of 
CED-6 was used indicating that the C-terminal part might be implicated in proper 
5 targeting. Since the actual expression level varies from cell to cell one can observe 
an ^optotic phenotype in highly expressing cells and an elevated level of 
phagocytosis in strong expressing cells. In addition, localization to the lamelli was 
observed in some cells which perform engulfment. 

The transfected MCF7 cells as above are useful for conducting assays to 
10 identify compoxmds which inhibit and enhance CED-6 or CED-6 as will be 
discussed hereafter. 

Human Homologues of C. Elegans CED-6 

In accordance with the invention there is provided an isolated protein which 
is an adaptor molecule in a signal transduction pathway which regulates 
15 phagocytosis of apoptotic cells. 

In accordance with another embodiment of the invention there is provided an 
isolated protein which is a human homologue of C.elegans CED-6 which comprises 
an amino acid sequence as shown in Figure 20 or Figure 22 (SEQ ID No: 8) or an 
amino acid sequence which differs from that shown in Figure 20 only in 
20 conservative amino acid changes (hi CED-6). 

Also provided is a nucleic acid (DNA RNA, cDNA or genomic DNA; SEQ 
ID NO: 7, 13, 15) encoding hlCED-6, h2CED-6 or h3CED-6 (SEQ ID Nos: 8, 14, 
16) or a functional equivalent thereof For example the invention encompasses a 
nucleic acid comprising the sequence of nucleotides from about nucleotide position 
25 430 to about nucleotide position 1344 shown in Figure 18, Figure 19, or Figure 22 
or the entire sequence of nucleotides shown in these figures. The invention includes 
the open reading frame of the nucleic acid sequence that encodes c. elegans CED-6, 
CED-6, h2CED-6 or h3CED-6. 

The invention also provides a protein which is a fragment of the protein with 
30 the amino acid sequence shown in Figure 20, Figure 22 or Figure 29 (SEQ ID No: 8, 
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14, 16). The fragment may comprise a sequence of amino acids corresponding to 
the phosphotyrosine binding domain of SEQ ID NOs: 8, 14, 16. For example, the 
PTB domain of SEQ ID Nos: 8 or 16 is from amino acids Nos. 15-157. The 
invention also pertains to the nucleic acid and amino acid sequences of the 
5 proline/serine rich domains of hlCED-6 and/or h3CED-6 (e.g., amino acid Nos.: 
201-276 in Figures 20, 22, or 29). Similarly, the highly charged region of SEQ ID 
NOs.: 8 or 16 is encompassed by the invention (e.g., amino acid Nos. 161-195 of 
Figures 20, 22 and 29). The invention includes the nucleic acid sequences that 
encode these fragments. 

10 There is also identified herein a splice variant of h3CED-6 (referred to herein 

as h2CED-6) which variant comprises an amino acid sequence as shown in Figure 
21 (SEQ ID No: 14) or an amino acid sequence which differs from that shown in 
Figure 21 only in conservative amino acid changes. Also provided is a nucleic acid 
(DNA, RNA, cDNA or genomic DNA) encoding h2CED-6 (SEQ ID No: 13) or a 

15 functional equivalent thereof, for example a nucleic acid comprising from about 
nucleotide position 430 to about nucleotide position 1206 in Figure 19 or the entire 
nucleotide sequence shown m Figure 19. (SEQ ID No: 13) 

The human CED-6 amino acid sequence (SEQ ID NO: 16) is also shown in 
Figure 26. Amino acid sequence SEQ ED NO: 16 (human CED-6) and SEQ ID NO: 

20 8 (hi CED-6) differ at amino acid No. 150. The nucleic acid sequence (SEQ ID NO: 
15) that encodes human CED-6 is shown in Figure 30A-B. The claimed invention 
includes SEQ ID NOs: 15 and/or 16, the open reading frame of SEQ ID NO.: 15, 
and the nucUc acid and amino acid sequence that encoded the functional fragments, 
(e.g., serine/ protein rich region, the PTB domain or the highly charged domain), as 

25 described herein. 

The invention also provides a fusion protein in which one part of the fiision 
is a protein having an amino acid sequence as shown in any of SEQ ID Nos: 8, 14 or 
16 or a sequence differing from acid sequences only in conservative amino acid 
changes. The protein may be fused with, for example, an epitope tag or expression 

30 product of a reporter gene. 



wo 99/37770 PCTAJS99/01361 

-33- 



The present invention is intended to encompass CED-6 proteins (e.g.. C. 
elegans CED-6, hi CED-6, h2 CED-6 and/or h3 CED-6) and polypeptides having 
amino acid sequences analogous to the amino acid sequences of CED-6. Such 
polypeptides are defined herein as CED-6 analogs (e.g., homologues), orthologs, or 

5 mutants or derivatives. Analogous amino acid sequences are defined herein to mean 
amino acid sequences with sufficient identity of CED-6 (e.g., C. elegans CED-6, 
hi CED-6, h2CED-6 or h3CED-6) amino acid sequence to possess the biological 
activity of CED-6. For example, an analog polypeptide can be produced with 
"silent" changes in the amino acid sequence wherein one, or more, amino acid 

10 residues differ from the amino acid residues of the CED-6, yet still possesses the 
biological activity of CED-6. Examples of such differences include additions, 
deletions or substitutions of residues of the amino acid sequence of CED-6. Also 
encompassed by the present mvention are analogous polypeptides that exhibit 
greater, or lesser, biological activity of the CED-6 proteins of the present invention. 

15 The claimed CED-6 protein and nucleic acid sequences include homologues, 

as defined herein. The homologous proteins and nucleic acid sequences can be 
detennined using methods known to those of skill in the art. Initial homology 
searches can be performed at NCBI against the GenBank (release 87.0), EMBL 
(release 39.0), dbEST SwissProt (release 30.0) databases using the BLAST network 

20 service and other EST databases. Altshul, SF, et al, Rapjp Lopal Alignment Search 
Tool . J. Mol. Biol. 215: 403 (1990), the teachings of which are incorporated herein 
by reference. Computer analysis of nucleotide sequences can be performed using 
the MOTIFS and the FindPattems subroutines of the Genetics Computing Group 
(GCG, version 8.0) software. Protein and/or nucleotide comparisons can also be 

25 performed according to Higgins and Sharp (Higgins, D.G. and P.M. Sharp, 
"Description of the method used in CLUSTAL," Gene. 73: 237-244 (1988)). 
Homologous proteins and/or nucleic acid sequences to the CED-6 protein and/or 
nucleic acid sequences tiiat encode tiie CED-6 protein are defined as those molecules 
with greater than 70% sequences identity and/or similarity (e.g., 75%, 80%, 85%, 

30 90%, or 95% homology). 
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The "biological activity" of CED-6 proteins is defined herein to mean the 
ability to regulate or affect the phagocytosis of apoptotic cells. 

The claimed CED-6 proteins also encompasses biologically active 
polypeptide firagments of the CED-6 proteins, described herein. Such fragments can 
5 include only a part of the full-length amino acid sequence of an CED-6 yet possess 
the ability to modulate or regulate phagocytosis of apoptotic cells. For example, 
polypeptide fragments comprising deletion mutants of the CED-6 proteins can be 
designed and expressed by well-known laboratory methods. Such polypeptide 
fragments can be evaluated for biological activity, as described herein. 
10 Antibodies can be raised to the CED-6 proteins and analogs, using 

techniques known to those of skill in the art. These antibodies polyclonal, 
monoclonal, chimeric, or fragments tliereof, can be used to immunoaffinity purify or 
identify CED'6 proteins contained in a mixture of proteins, using techniques well 
known to those of skill in the art. These antibodies, or antibody fragments, can also 
1 5 be used to detect the presence of CED-6 proteins and homologs in other tissues 
using standard immunochemistry methods. 

In particular, biologically active derivatives or analogs of the above 
described proteins, includmg fragments and functional domains from c. elegans 
CED-6, hi CED-6, h2CED-6, or h3CED-6, referred to herein as peptide mimetics, 
20 can be designed and produced by techniques known to those of skill in the art. (see 
e.g., U.S. Patent Nos. 4,612,132; 5,643,873 and 5,654,276, the teachings of which 
are incorporated herein by reference). These mimetics can be based, for example, on 
a specific CED-6, hlCED-6 or h2CED-6 or h3CED-6 amino acid sequence and 
maintain the relative position in space of the corresponding amino acid sequence. 
25 These peptide mimetics possess biological activity similar to the biological activity 
of the corresponding peptide compound, but possess a "biological advantage" over 
the corresponding CED-6 amino acid sequence with respect to one, or more, of the 
following properties: solubility, stability and susceptibility to hydrolysis and 
proteolysis. 

30 Methods for preparing peptide mimetics include modifying the N-terminal 

amino group, the C terminal carboxyl group, and/or changing one or more of the 
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amino linkages in the peptide to a non-amino linkage. Two or more such 
modifications can be coupled in one peptide mimetic molecule. Modifications of 
peptides to produce peptide mimetics are described in U.S. Patent Nos. 5,643,873 
. and 5,654,276, the teachings ofwhich are incorporated herein by reference. Other 
5 forms of the hi , h2, or h3 CED-6 proteins, encompassed by the claimed invention, 
include those which are "functionally equivalent." This term, as used herein, refers 
to any nucleic acid sequence and its encoded amino acid which mimics the 
biological activity of the hi, h2, or h3 CED-6 proteins and/or functional domains 
thereof. Biologically active is used to describe a protein capable of regulating the 

1 0 phagocytosis of apoptotic cells. 

A polypeptide can be in the form of a conjugate or a fusion protein, both of 
which can be made by known methods. Fusion proteins can be manufactured 
according to known methods of recombinant DNA technology. For example, fusion 
proteins can be expressed firom a nucleic acid molecule comprising sequences which 

15 code for a biologically active portion of the protein and its fusion partner, for 
example a portion of an immimoglobulin molecule. For example, some 
embodiments can be produced by the intersection of a nucleic acid encoding 
immunoglobulin sequences into a suitable expression vector, phage vector, or other 
commercially available vectors. The resulting construct can be introduced into a 

20 suitable host cell for expression. Upon expression, the fusion proteins can be 
isolated or purified from a cell by means of affinity matrix. 

Expression vectors incorporating any of the above mentioned nucleic acids 
including those designated SEQ ID Nos: 1, 3, 5, 7, 9, 1 1, 13 or 15, optionally with a 
reporter gene as aforesaid, are also provided by the invention. 

25 The present invention also encompasses isolated nucleic acid sequences 

encoding the CED-6 (e.g., C elegans CED-6, hlCED-6, h2CED-6 or h3CED-6) 
proteins described herein, and fragments of nucleic acid sequences encoding 
biologically active CED-6 proteins. Fragments of the nucleic acid sequences, 
described herein, are useful as probes. Specifically provided for in the present 

30 invention are DNA/RNA sequences encoding CED-6 proteins, the fiilly 

complementary strands of these sequences, and allelic variations thereof. Also 
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encompassed by the present invention are nucleic acid sequences, genomic DNA, 
cDNA, RNA or a combination thereof, which are substantially complementary to 
the DNA sequences encoding CED-6, and which specifically hybridize with the 
CED-6 DNA sequences imder conditions of stringency known to those of skill in the 
5 art, those conditions being sufficient to identify DNA sequences with substantial 
nucleic acid identity. As defined herein, substantially complementary means that the 
sequence need not reflect the exact sequence of the CED-6 (e.g., C. elegans CED-6, 
hi CED-6, h2CED-6 or h3 CED-6) DNA, but must be sufficiently similar in identity 
of sequence to hybridize with CED-6 DNA under stringent conditions. Conditions 
10 of stringency are described in e.g., Ausebel, F.M., et aL, Current Protocols in 

Molecular Biology, (Current Protocols, 1994). For example, non-complementary 

than CED-6 DNA, provided that the sequence has a sufficient number of bases 
complementary to CED-6 to hybridize therewith. Exemplary hybridization 
1 5 conditions are described herein. 



Cloning of human CED-6 

Following the cloning of the C.elegans CED-6 gene and the fiill sequencing 
of the open reading firame, extensive searches against public domain human 
databases were performed. These revealed statistically significant homologies to a 

20 number of ESTs at the carboxy terminal region of the protein and one EST showed 
homology to the carboxy terminal of the PTE domain and at the beginning of the 
charged region. These ESTs were used for construction of primers for 5 'RACE 
using a Marathon- ready cDNA colorectal adenocarcinoma Ubrary fi<om Clontech. 
Subsequent additional sequence analysis and rounds of database searching revealed 

25 additional ESTs which enabled construction of a concensus sequence of 

approximately 2400 bp for h3CED-6 (Figure 6). Further sequence analysis has 
revealed a splice variant of the sequence shown in Figure 18 (h2CED-6), the portion 
which is alternatively spliced being underlined. The DNA of h2CED-6 is shown in 
Figure 19 and the amino acid sequence in Figure 21. The amino acid sequence of 
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h2CED-6 is consistent with it being a dominant negative version of hi or h3 CED-6 
which antagonizes active of hi or h3CED-6. 



Assays for the identification of inhibitors and enhancers of CED-6 
hlCED-6, h2CED-6, or h3CEP-6 
5 The cloning and functional characterization of C.elegans CED-6 and its two 

human homologues have permitted assay methods to be developed which allow 
identification of compounds which might inhibit or enhance CED-6, hi CED-6, 
h2CED-6, or h3CED-6 activity or inhibit or enhance the transcription of these 
proteins. These may involve detection of the level of phagocytosis of apoptotic 

10 particles, measurement of level of actin-cytoskeleton rearrangement or detection of 
the level of transcription of the CED-6 proteins via a reporter gene such as GFP. 

An assay for the identification of inhibitors and/or enhancers of phagocytosis 
may consist of a cell line stably or transiently transfected with CED-6, hi CED-6, 
h2CED-6, or h3CED-6 or any other member of the CED-6 signal transduction 

1 5 pathway. Cell lines may also be microinjected with purified protein or vectors 
expressing antisense RNA. The expression product may be a fusion protein with 
GFP. Non transfected cells can be used in the assay also. The cell line may be a 
fibroblast cell line such as COSI, BHK 21, L929, CVl, Swiss 3T3, HT144, IMR32 
or another fibroblast cell line. The cell line may also be an epithelial cell line such as 

20 HEPG2, MDCK, MCF7, 293, Hela, A549, SW48, G361, or any other epithelial cell 
line. The cell line may a primary line, such as human dermal FIBs, dermal 
keratinocytes, leucocytes, monocytes, macrophages, or any other primary cell line. 
Cells may be double transfected with other genes (like lectin, CD 14, SRA, CD36 
ABCl, CED5, DOCK! 80) being from vertebrate (human fish, mouse) or 

25 invertebrate origin (C.elegans). 

Phagocytosis assays consist of the addition of and uptake of particles and/or 
apoptotic cells, by these cell lines. The particle may be opsonized heat or chemically 
killed bacteria and yeast in a variety of sizes, shapes and natural antigenicities. The 
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particle or cell may be an opsonized, fluorescently labeled, heat or chemically killed 
bacteria and yeast in a variety of sizes , shapes and natural antigenicities. The cell 
may be a apoptotic neutrophils, apoptotic lymphocytes, apoptotic erythrocytes or 
any other apototic cell. These apoptotic cells may be opsonized and/or labeled with 
5 dyes or fluorescent dyes. The killed bacteria or yeast cells and the apoptotic cells are 
referred to as herein apoptotic particles. 

Assay 1 

Cells, transfected with CED-6 or any other gene described herein, for 
example, nucleic acids of SEQ ID Nos: 1, 3, 7, 9, 1 1, 13, or 15, can be grown in 

10 monolayer or in suspension. The apoptotic particles are added to the transfected cell. 
Phagoci^osis can be followed by the uptake rate of the apoptotic particles. This can 
be measured by microscopy, by fluorescence microscopy, by quantitative 
spectrofluorometry and by flow cytometry. Cells and or particles may additionally 
be labeled with dyes, fluorescent dyes, antibodies and dyes of fluorescent dyes 

15 linked to antibodies prior to detection and measurement. Decrease or increase of the 
uptake of the apoptotic particles is a measurement for the influence of the 
transfected gene or genes in the phagocytosis. 



Assay 2 

Compounds can be added to assay 1 to test their influence on the genes that 
20 are involved in the phagocytosis pathway. Transiently or stably transfected cells are 
grown in suspension or in monolayer. A series of compounds is added to the cells 
prior to the addition of the apoptotic particles. The influence of the compounds can 
be measured by comparing the uptake rate of the apoptotic particles with and 
without the addition of the compound. Measurements are described in Assay 1 



25 Assay 3 

Cells are able to phagocytose apoptotic particles by engulfment of particles. 
This involves the reorganization of the actin cytoskeleton. Mammalian cells, may be 
transiently or stably transfected with CED-6 or any gene involved in the CED-6 
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phagocytosis signal transduction pathway, for example, with a nucleic acid have the 
sequence of nucleotides shown in any one of SEQ ID Nos: 1, 3, 5, 7, 9, 1 1, 13 or 15. 
Cells can be any cell as described in Assay I , The genes may be expressed as a GPF 
fusion product. Cells may be double transfected (see Assay 1). The reorganization of 
5 the actin cytoskeleton can be visualized with fluorescent dyes linked to phalloidine, 
which interacts with F-actin, Reorganization of the cytoskeleton is an measurement 
for the engulfinent induction by the transfected gene or genes. Transfected cells may 
be treated with particles or apoptotic cells as described in Assay 1. Reorganization 
of the cytoskeleton is visualized by microscopy or fluorescence microscopy. 

Assay 4 

Compounds can be added to Assay 3 to test their influence on the genes that 
are involved in the cytoskeleton reorganization related to the phagocytosis pathway 
and engulfinent. These compounds may enhance or inhibit the engulfinent or 
cytoskeleton reorganization induced by the introduced genes. Transiently or stably 
transfected cells are grown in suspension or in monolayer, A series of compoimds is 
added to the cells. The influence of the compounds can be measured by comparing 
the reorganization of actin cytoskeleton with and without the addition of the 
compound. Measurements as are described in Assay 1, Assay 2 and Assay 3. 
Apoptotic particles may be added in this test to induce phagocytosis, as described in 
Assay 2. 

Assay 5 

Non-transfected or transfected cell-lines such as those described above may 
be microinjected with purified CED-6 protein, for example, a protein having the 
amino acid sequence as shown in SEQ ED Nos: 2, 4, 6, 8, 10, 12, 14, or 16 or any 
25 protein from the CED-6 pathway or a fusion protein comprising any of said proteins. 
Microinjection can be done on the primary cell lines or the fibroblast cell lines or the 
other epithelial cells lines. The cell lines can be transfected with another gene prior 
to microinjections. Assays 1 through Assay 4 can be performed on these 
microinjected cells. 



10 



15 



20 
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Assay 6 

Transfected or non-transfected cell-lines as described above may be 
microinjected with a vector expressing CED-6 antisense RNA including antisense 
RNA in respect of any of the aforementioned proteins or any antisense RNA for 
5 genes involved in the CED-6 pathway. Microinjection can be done on the primary 
cell lines or the fibroblast cell lines or the epithelial cell lines. The cell lines can be 
transfected with another gene prior to microinjection. Assays 1 through Assay 5 can 
be performed on these microinjected cells. 

Assay 7 

10 Cell lines, as described in Assay 6 may be micro-injected with a vector 

expressing CED-6 antisense RNA or any antisense RNA for genes involved in the 
CED-6 pathway. Microinjection can be done on the macrophages. Inhibitory effects 
of the antisense RNA by inhibition of the CED-6 gene or genes involved in the 
CED-6 pathway can be followed and detected as described in Assay 1 through 

15 Assay 6. Compounds can be isolated which rescue the negative phenotype. 

Phagocytosis assays to screen for CED-6 inhibitor/enhancers in C.elegans 

The C.elegans CED-6 gene promotes the engulftnent of dying embryonic 
and germ cells and persistent cell corpses. C.elegans may be used for detection and 
isolation of compoimds that have an enhancing or inhibitory influence on 
20 phagocytosis and engulfment. In particular mutant worms lacking CED-6 activity or 
with otherwise altered CED-6 activity may be used or alternatively a transgenic 
worm transfected or transferred with CED-6, hi CED-6, h2CED-6. or h3 CED-6 
DNA may be used. 

Assay 8 

25 A series of compoimds may be applied on CED-6 mutant worms or on 

worms harboring mutations in the CED-6 pathway. Restoration of engulfinent 
induced by the compounds can be visualized using Nomarski microscopy by 
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counting cell corpses remaining in the head region of LI larvae and in the gonads of 
the worms. 

Assay 9 

A series of compounds may be applied on humanized CED-6 mutant worms. 
5 Humanized worms are worms expressing the human CED-6 gene and are mutated 
for the C.elegans gene. Human CED-6 rescues the mutant phenotype. Compounds 
inhibiting or enhancing the CED-6 phenotype can be selected by visualization of the 
engulfinent phenotype using Nomarski microscopy and looking for cell corpses as 
aforesaid. 

10 Medical applications 

The process of apoptosis has been implicated in the etiology - or associated 
with the pathology - of a wide range of diseases, including cancer, autoinmiune 
diseases, various neurodegenerative diseases such as Amyotrophic Lateral Sclerosis, 
Huntington's Disease, and Alzheimer's Disease, stroke, myocardial heart infarct, 
15 and AIDS (Thompson, 1995), Thus a better understanding of the molecular events 
that underlie apoptosis might lead to novel therapeutic interventions. While much of 
the current attention is centered on the genes and proteins that control the killing 
step of the death process, it is very likely that the removal of apoptotic cells will 
prove to also be crucial for the proper overall functioning of the apoptotic program, 
20 and will offer another entry point for therapeutic intervention. 

The process of recognition and engulfment of dying cells is extremely swift 
and efficient. In animals, it is essentially impossible to find a cell with apoptoic 
features that is not already within another cell. Such rapid recognition and 
phagocytosis of apoptotic cells is an crucial aspect of programmed cell death in vivo: 
25 unengulfed apoptotic bodies can undergo secondary necrosis, leading to 

inflammation. Failure to remove apoptotic bodies also exposes the body to novel 
epitopes (fix>m e.g., caspase-gencrated protein fragments), possibly encouraging the 
development of autoinunune disease. Persistent apoptotic bodies can often be 
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observed following chemotherapeutic intervention (which leads to extensive 
apoptosis) and are particularly abundant in solid tumors, in which clearance of cell 
corpses might be delayed. 

It is likely that failure to properly dispose of apoptoic cells leads to human 

5 disease. Genes involved in phagocytosis could therefore correspond to currently 
uncloned himian inherited disease genes. Restoring proper phagocytosis would be a 
valid therapy for certain types of inflammation and autoimmune diseases. 
Conversely, In some cases, cells that should be maintained are inappropriately 
recognized by the engulfinent machinery and cleared from the body. Preventing the 

10 engulfinent of such cells could be of great therapeutic value. Examples of such 

diseases might include neurodegenerative diseases and stroke, as well as sickle cell 
aenemia. 

In addition activation of engulfinent could be used for the same cases for 
which it is proposed to use activation of apoptosis, e.g., cancer. Indeed, specific 

1 5 activation within the cancer cells of the pro-engulfing signal would lead to the cells' 
removal - (and death) - without needing to activate the rest of the apoptotic 
machinery. This could be particularly usefiil for highly resistant tumors in which 
crucial elements of the central apoptotic machinery have akeady been inactivated. 
Thus, in accordance with another of its aspects the invention provides a 

20 method of treating, for example inflammation, autoinmiune disease and cancer by 
administering to a patient an effective amount of a substance which enhances 
phagocytosis of apoptoic cells, in particular a substance which enhances the activity 
of hl-CED6, h3-CED-6 or the signal transduction pathway in which it participates. 
Such substances includes hl-CED 6 or h3-CED-6 itself, a nucleic acid encoding hl- 

25 CED6 or h3-CED-6, an anti-sense nucleic acid to hi , h2 or h3 CED-6 or compounds 
identified in any of the aforementioned assays as enhancers of CED-6, hi -CED-6, 
h2-CED-6, or h3-CED-6 or of transcription thereof 

In addition the invention also enables a method of treatment of, for example, 
neurodegenerative diseases, stroke and sickle-cell anaemia by administering to a 

30 patient an effective amount of a substance which inhibits phagocytosis of apoptotic 
cells, in particular a substance which inhibits the activity of hl-CED6 or h3-CED6 
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or the signal transduction pathway in which it participates. Such substances include 
h2 CED-6, a nucleic acid encoding h2CED-6, an anti-sense nucleic acid to hlCED-6 
or h3CED-6 or compounds identified in any of the aforementioned assays as 
inhibitors of CED-6 or hi CED-6 or h3 CED-6 or of transcription thereof. 
5 Pharmaceutical compositions comprising any of the above-mentioned 

therapeutic substances and a pharmaceutically acceptable carrier are also envisaged 
by the invention. 

To accomplish the various therapeutic treatments as described herein, a 
nucleic acid which encodes hi, h2 or h3 CED-6 or a functional portion or domain 

10 thereof must be introduced into a mammalian cell (e.g., mammalian somatic cell, 
mammalian germ Une cell (sperm and egg cells)). This can be accomphshed by 
inserting the isolated nucleic acid that encodes either the full length protein, or the 
domains described herein, or a functional equivalent thereof, into a nucleic acid 
vector, e.g., a DNA vector such as a plasmid, virus or other suitable replicon (e.g., a 

1 5 viral vector), which can be present in a single copy or multiple copies. The nucleic 
acid may be transfected or transformed into cells using suitable methods known in 
the art such as electroporation, microinjection, infection, and lipoinfection and direct 
uptake. Such methods are described in more detail, for example, in Sambrook et aL, 
"Molecular Cloning: A Laboratory Manual," 2nd ED. (1989), Ausubel, P.M., ei al, 

20 Current Protocols in Molecular Biology, (Current Protocol, 1994) and Sambrook et 
al, "Molecular Cloning: A Laboratory Manual," 2nd ED. (1989). 

hi, h2 or h3 CED-6 can be delivered to a cell by the use of viral vectors 
comprising one or more nucleic acid sequences encoding those proteins. Generally, 
the nucleic acid sequence has been incorporated into the genome of the viral vector, 

25 In vitro, the viral vector containing hi, h2 or h3 CED-6 protein described herein or 
nucleic acid sequences encoding the protein can be contacted with a cell and 
infectivity can occur. The cell can then be used experimentally to study 
phagocytosis of apoptotic cells or for assays as aforesaid or be implanted into a 
patient for therapeutic use. The cell can be migratory, such as hematopoietic cells, 

30 or non-migratory such as a solid tumor or fibroblast. The cell can be present in a 
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biological sample obtained from the patient (e.g., blood, bone marrow) and used in 
the treatment of disease, or can be obtained from cell culture. 

After contact with the viral vector comprising the hi, h2 or h3 CED-6 
protein or a nucleic acid sequence encoding them, the sample can be returned or 
5 readministered to a cell culture or patient according to methods known to those 
practiced in the art. In the case of delivery to a patient or experimental animal 
model (e.g., rat, mouse, monkey, chimpanzee), such a treatment procedure is 
sometimes referred to as ex vivo treatment or therapy. Frequently, the cell is 
targeted from the patient or animal and remmed to the patient or animal once 

10 contacted with the viral vector comprising the activated mutant of the present 

invention. Ex vivo gene therapy has been described, for example, in Kasid, et al, 
Proa Natl Acad, Sci. USA 57:473 (1990); Rosenberg, etaL, Neyj Engl J, Med. 
323:570 (1990); Williams, et al. Nature 310476 (1984); Dick, et al. Cell 42:71 
(1985); Keller, et al. Nature 575: 149 (1985) and Anderson, et al, U.S. Patent No. 

15 5,399,346(1994). 

Where a cell is contacted In vitro, the cell incorporating the viral vector 
comprising a nucleic acid sequence of hi CED-6, h2 CED-6 or h3CED-6 can be 
implanted into a patient or experimental animal model for delivery or used in In 
vitro experimentation to study cellular events mediated by hi, h2 or h3 CED-6. 

20 Various viral vectors can be used to introduce the nucleic acid into 

mammalian cell. Viral vectors include retrovirus, adenovirus, parvovirus (e.g., 
adeno-associated viruses), coronavirus, negative strand RNA viruses such as 
orthomyxovirus (e.g., influenza virus), rhabdovfrus (e.g., rabies and vesicular 
stomatitis virus), paramyxovirus (e.g. measles and Sendai), positive strand RNA 

25 viruses such as picomavirus and alphavirus, and double stranded DNA viruses 
including adenovirus, herpesvirus (e.g.. Herpes Simplex virus types 1 and 2, 
Epstein-Barr virus, cytomegalovirus), and poxvirus (e.g., vaccinia, fowlpox and 
canarypox). Other viruses include Norwalk virus, togavirus, flavivirus, reoviruses, 
papovavirus, hepadnavirus, and hepatitis vims, for example. Examples of 

30 retroviruses include: avian leukosis-sarcoma, mammalian C-type, B-type viruses, D- 
type viruses, HTLV-BLV group, lentivirus, spumavirus (CofBn, J.M., Retroviridae: 
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The viruses and their replication. In Fundamental Virology, Third Edition, B.N. 
Fields, et al, Eds., Lippincott-Raven Publishers, Philadelphia, 1996). Other 
examples include murine leukemia viruses, murine sarcoma viruses, mouse 
mammaiy tumor virus, bovine leukemia virus, feline leukemia virus, feline sarcoma 

5 virus, avian leukemia virus, human T-cell leukemia virus, baboon endogenous virus. 
Gibbon ape leukemia virus. Mason Pfizer monkey virus, simian immunodeficiency 
virus, simian sarcoma virus, Rous sarcoma virus, lentiviruses and baculoviruses. 

A preferred method to introduce nucleic acid that encodes hi, h2 or h3 CED- 
6 into cells is through the use of engineered viral vectors. These vectors provide a 

10 means to introduce nucleic acids into cycling and quiescent cells, and have been 
modified to reduce cytotoxicity and to improve genetic stabiUty. The preparation 
and use of engineered Herpes simplex virus type 1 (D.M. Krisky, et al.. Gene 
Therapy 4(10): 1 120-1 125. (1997)), adenoviral (A, Amalfitanl, et al.. Journal of 
Virology 72(2):926-933. (1998)), attenuated lentiviral (R. ZufFerey, et al., Nature 

15 Biotechnology 15(9)871-875 (1997)) and adenoviral/retroviral chimeric (M. Feng, et 
al.. Nature Biotechnology 15(9):866-870 (1997)) vectors are known to the skiUed 
artisan. 

Hence, the claimed invention encompasses various therapeutic uses as 

aforesaid for the hi, h2 or h3 CED-6 protein or nucleic acid. 
20 The protein may be administered using methods known in the art. For 

example, the mode of administration is preferably at the location of the target cells. 

As such, the administration can be nasally (as in administering a vector expressing 

ADA) or by injection (as in administering a vector expressing a suicide gene tumor). 

Other modes of administration (parenteral, mucosal, systemic, implant, 
25 intraperitoneal, etc.) are generally known in the art. The agents can, preferably, be 

administered in a phannaceutically acceptable carrier, such as saline, sterile water, 

Ringer's solution, and isotonic sodium chloride solution. 

The invention also provides diagnostic reagents which may be used in the 

diagnosis of a disease associated with a defect in phagocytosis of apoptotic cells. 
30 For example, an antibody to an epitope of any of the proteins with an amino acid 

sequence as shown in SEQ ID Nos: 2, 4, 6, 8, 10, 12, 14 or 16 could be used as a 



wo 99/37770 



PCTAJS99/01361 



-46- 

diagnostic reagent to determine whether a patient has a defect in hlCED-6, h2CED- 
6 or h3CED-6 or in the expression thereof. In addition defects at the genetic level 
can be detected by using as a probe a nucleic acid having a sequence as shown in 
SEQ ID Nos: 1, 3, 5, 7, 9, 1 1, 13, or 15 or portions thereof. 

5 Identification of the other proteins active in the CED-6 signal transduction pathway 
CED-6, hlCED-6, h2CED-6 or h3CED-6 can be used to identify other 
members of the signal transduction pathway promoting phagocytosis of apoptotic 
cells. There are number of possible methods by which this can be done but a 
preferred method is the so-called *two hybrid" system developed in yeast by Chien 
10 et al (1994, Proc. Natl. Acad Sci. USA 88 pp 9578-9582) which allows 
identification of proteins which bind to a particular protein of interest. 

This technique is based on functional in vivo reconstruction of a transcription 
factor which activates a reporter gene. More particularly the technique comprises 
providing an appropriate host cell, preferably yeast, with a DNA construct 

1 5 comprising a reporter gene under the control of a promoter regulated by a 
transcription factor having a DNA binding domain and an activating domain, 
expressing in the host cell a first hybrid DNA sequence encoding a first fusion of a 
fi-agment or all of a nucleic acid sequence according to the invention and either said 
DNA binding domain or the activating domain of the transcription factor, expressmg 

20 in the host cell at least one second hybrid DNA sequence encoding putative binding 
proteins to be investigated together with the DNA binding domain or activating 
domain of the transcription factor which is not incorporated in the first fusion; 
detecting any binding of the protein being investigated with a protein according to 
the invention by detecting for the production of any reporter gene product in the host 

25 cell; optionally isolating second hybrid DNA sequence encoding the binding protein. 



EXAMPLES 

The N2 Bristol strain was used as the reference wild-type strain for this 
study. All strains were maintained as described by Brenner (Brenner, 1974), except 
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that worms were raised on NGM-lite agar medium. Strains were maintained and 
raised at 20°C, unless otherwise noted. The following mutations were used In this 
study: LG I: ced'l(el735), ced'l(nl995) and ced-l(nl506) (Ellis et al, 1991); LG 
III: dpy-17(el64), ced-6(nl813, n2095l mec'14(u55), ncl-l(el865) ced-7(nl997), 
5 ced'7(nl892), ced-7(nl996) (Ellis et al, 1991), unc-36(e251) (Brenner, 1974) and 
sDp3(IIlfi (Rosenbluth et al, 1985); on LG IV: ced'2(el7S2) (Hedgecock et al, 
1983), ced-5(nl812) and ced-10(nl993) (Ellis et al, 1991). All mutations are 
described in Hodgkin (1997). 

EXAMPLE 1 

10 Analysis and Quantifying of Engulfinent 

Animals were anesthetized with 3GrriM NaN3 and mounted on agar pads to 
observation using Normarski optics microscope (Sulston & Horvitz, 1977; Avery 
and Horvitz, 1987). To quantify engulfinent of cell corpses generated during 
embryonic development, the number of persistent ceil corpses that were visible in 

15 the head region of young LI larvae that still had only four cells in gonad (i.e., had 
hatched in the previous four hours) were scored. To quantify the germ line 
engulfinent defect, cell corpses visible within both the distal arm (where the germ 
cell deaths occur) and the proximal arm (where persistent germ cell corpses can 
sometimes be observed as they are swept along by the developing oocytes) were 

20 counted. 

EXAMPLE 2 

Gennlme Transformation and Genomic Rescue ofced-6 

Transgenic animals were generated using the germline microinjection 
procedure developed by Mello et al. Cosmids W03A5, F20F10, F48E8, R02F2, 
25 W02G12, T06H6, C48E6, C44D7. F56D2, F43F12, C05D2, T06C9, C05H8 were 
injected, either singly or in groups (final concentration 20ng/ul for each cosmid), 
into ced-6(nl813) animals. Plasmid pRF4 was used (final concentration 50-80 
ng/ul) as the dominant co-injection marker (Mello et al., 1991); pRF4 carries the 
mutated collagen gene rol-6(suJ006gf) and confers a dominant roller (Rol) 
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phenotype. Transgenic lines carrying stably transmitting extrachromosomal arrays 
were kept for further analysis. To assay for rescue, three-fold embryos laid by 
transgenic animals were examined for cell corpses under Normaski optics. 
Transgenic lines that generated embryos with fewer or no corpses were considered 
5 to be rescued. To further define the position of ced-6 within F56D2, a number of 
deletion constructs were created and other fragments subcloned into pBluescript 
SK(+) n. 50-90 ng/ul of these clones were co-injected with 80-100 ng/ul pRF4 
injection marker into ced-6(nJ813) worms, and their rescuing ability tested as 
described above. 

10 EXAMPLE 3 

Isolation oiced-u cDNAs 

To isolate full-length ced-6 cDNAs, a mixed-stage Celegans lambda Zap 
cDNA library was screened (gift of R. Barstead, Oklahoma Medical Research 
Foundation, Oklahoma City, OK) using established protocols (Sambrook et al., 

15 1 989). ^^P-labeled probe was made usmg the rescuing 10 kb Xho I genomic 

fragment as template. Positive phage were transformed into plasmid clones using 
the in vivo excision protocol. The clones representing F56D2.7 gene from isolated 
plasmid clones were identified on a Southern blot. For this purpose a ^^P-labeled 
probe was generated from RT-PCR product, which represents three exons of 

20 predicted F56D2.7. Primers used for RT-PCR: GAATGTTCTCATTTATTG (SEQ 
ID NO.: 29) and GGATTCAAACGATCCGATG. (SEQ ID NO: 17) 

From about 300,000 plaques 10 plasmid clones corresponding F56D2.7 
cDNAs were isolated. These clones were sequenced for both ends of the insert 
using the flanking T3 and T7 primers. Two clones with partial SL2 sequence at the 

25 5' end and intact poly{A) tail were identified as full-length F56D2.7 cDNAs. 

Analysis of these sequence results and the pattern of restriction digestion by Sau3A 
I also suggested that these clones represent for one transcript, 

EXAMPLE 4 

Reverse transcription-PCR 
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Reverse transcription (RT)-PCR experiments were performed to determine 
the 5'end of transcripts detected or predicted within the rescuing Xho I genomic 
fragment. Reverse transcription was performed with following primers: C05D2,6a: 
GAATCTGTCCATCGCATTGC(SEQIDNO.: 18), 
5 GAATTTCTTTGGGTAGACA (SEQ ID NO.: 19); C05D2.6b: 

GCTCTGAAGAACTGTGA (SEQ ID NO.: 20), GACGAGGTGAAGCGATTGTG 
(SEQ ID NO.: 21); F56D2.7: GGGATCAAACGAATCATC (SEQ ID NO.: 22). 
These primers were then used in combination with SLl 
(GTTTAATTACCCAAGTTTGAG (SEQ ID NO.: 23)) or SL2 
1 0 (GGTTTT AACCC AGTTACTC AAG (SEQ ID NO. : 24)) primers for subsequent 
PGR amplification. Total C elegans mixed stage RNA was isolated as described 
previously. RT-PCR was performed using the Superscript Preamplication System 
(Gibco BRL). 

EXAMPLES 

1 5 Identification of ced-6 Mutations 

To determine whether either ced-6 allele resulted in a large physically 
detectable polymorphism, we generated Southern blots of N2, ced-6(nl813), and 
ced-6(n2095) genomic DNA digested with various restriction enzymes. A probe 
generated from the rescuing Xho I genomic fragment detected noval allele-specific 

20 bands in ced'6(n2095) using four different restriction enzymes. Analysis of the 

novel restriction patterns in ced-6(n2095) indicates that this allele carries a complex 
rearrangement in this region, that covers at least part of F56D2.7, but does not affect 
the neighboring C05D2.6b transcript. 

To identify point mutations within F56D2.7, overlapping fi^gments of the 

25 F56D2.7 locus from N2, ced-6(nl813), and ced-6(n2095) mutants were PGR 
amplified and directly sequenced using the PGR Product Sequencing Kit 
(Amersham). The overlapping PGR fragments covered the entire F56D2.7 
transcription unit and about 1 kb of upstream genomic sequence. Sequences of the 
primers used for PGR amplification and sequencing are available upon request. 
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EXAMPLE 6 

Heat Shock Experiments 

To test whether ced-6 cDNA can rescue the engulfinent defect, Kpn USal I 

fragment of full-length F56D2.7 cDNA was inserted in Kpn USac I site of MCS H of 
5 both pPD49.78 and pPD49.83 vectors which carry hspl6-2 and hspl6-41 promoters, 

creating the constructs pLQhsl and pLQhs2. The two constructs were co-injected, at 

50ng/ul each with 80ng/ul pRF4, to generated stably transmitting extrachromosomal 

arrays. For our control experiments, we used pPD50.21 and pPD50.15, two 

derivatives of pPD49.78 and pPD49.83 in which the lacZ open reading frame has 
10 been placed under heat shock promoters. Transgenic lines carrying these constructs 

were generated as described above. 

To overexpress ced-d before cell death occurs during embryonic 

development, adult animals were put on a plate seeded with E.coli and allowed to 

lay eggs for one hour. Plates were subsequently parafilmed and subjected to heat 
15 shock by transfer to 33°C waterbath for 45 minutes. Following a 75-minute recovery 

at 20°C, adult animals were removed from the plates. 12-14 hours after heatshock, 

hatching LI larvae were scored for corpses in the head region. 

To overexpress ced-6 after the formation of cell corpses during embryonic 

development, worm plates containing embryos at all developmental stages (but not 
20 larvae) were parafilmed and subjected to heat shock in a 33°C waterbath for 45 

minutes. Three hours after the heat shock, freshly hatched LI larvae were scored for 

corpses in the head region. 

To determine the effect of ced-6 overexpression before cell death occurs on 

the engulfinent of dying germ cells, L4 stage transgenic animals were transferred to 
25 new plates and stored at 20**C. Starting 24 hours after the L4 molt, the worm plates 

were parafilmed and heat shocked for 45 minutes at 33*^0 as described above. 

Animals were examined for germ cell corpses at 12 hours after heat shock, also 18, 

24, 36, and 60 hours after heat shock. 

To overexpress ced'6 after the formation of germ cell corpses, L4 stage 
30 transgenic animals were collected and put into several plates, a few for each plate. 

24 hours after the L4 molt one plate of worms were heat shocked for 45 minutes as 
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described above. Similarly, 36, 42, 48 and 60 hours after the L4 molt, each plate of 
worms at one time point were treated with heat. Animals were examined for germ 
cell corpses 12 hours after heat shock. 

To overexpress ced-6 in the background of other engulfinent mutants, the 

5 ced-6 or lacZ-expressing extrachromosomal arrays were transferred from 

ced-6(nl 813) to a wild-tj^je background, and crossed subsequently to ced-l(el 735) 
ced-l(nl506). ced-l(nl995). ced-7(1892), ced-7(nl996). ced-7(nl997). 
ced-2(nl752). ced-5(nl812) or ced-J0(nl993) to generate the corresponding 
transgenic mutant strains. Heat shock experiments were perfonned as described 

10 above. 



EXA1VEPLE7 

Genetic Mosaic Analysis 

1000 dpy-17(el64) ced-6(nI813) mec-14(u55) ncl-l(el865) unc-36(e25) HI; 
sDp3(IUJ) were put in worm plates individually. The progenies of these animals 

1 5 were examined to identify animals who laid only DP Y UNC progenies under the 
dissecting microscope. The adult animals were examined under the Normaski Optics 
immediately after being identified. First the somatic sheath cells were examined, 
then the body wall muscle descended from D and C lineages. When all body wall 
muscle cells displayed wild-type, the duplication is lost in P4 lineage. When body 

20 wall muscle cells from D lineage are wild-type, while those from C lineage exhibit 
ncl phenotype, the dupUcation must be lost from P3 lineage. When body wall 
muscle cells from both D and C lineages show the ncl phenotype, the dupUcation 
must be lost from P2 lineage. The cell corpse in both arms of gonad were also 
examined for the engulfinent phenotype. To fmd the animals with the duplication 

25 lost in the somatic sheath cells, but not in gemi cells, dpy-1 7(el 64) ced-6(nl813) 

mec-14(u55) ncl-l(el865) unc-36(e25) III; sDp3(IIIJ) animals were examined under 
the Normaski Optics for the loss of the duplication in somatic sheath cells. At the 
same time cell corpses in gonad were also examined for the engulfinent phenotype. 



EXAMPLES 
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Identification of a human homologue of CED-6 

Extensive searches (tblastn) with the ced-6 sequence (Figure 18 Consensus 
DNA Sequence of hCED-6) against the public domain databases (EST, Genbank, 
EMBL, Swissprot and PIR) revealed statistically significant homologies to some 

5 ESTS at the carboxyterminal region of the protein (AA443368, AA431995, 

R33389,R53881). One EST (T48513) showed homology to the Carboxyterminal of 
the PTB domain and the beginning of the charged region. For 5' RACE analyses a 
Marathon-ready cDNA colorectal adenocarcinoma, library was used from Clontech. 
The position of the primers used for RACE and sequencencing is indicated in figure 

10 18. By subsequent cloning and sequence analysis additional sequence information 
was obtained. Using this additional sequence information and subsequent rounds of 
database searching (blastn) revealed additional EST, which enabled us to construct a 
consensus of approx 2400 hp. This sequence was fiirther extended and verified by 
colony hybridization and sequencing additional RACE products. 

15 EXAMPLE 9 

RNA Blots (see Figure 25 expression pattern of hCED-6 in normal human 
tissues and cancer cell Unes by Northem blotting A) Human Multiple Tissue 
Northem (MTN) Blot B) Human Multiple Tissue Nortiiem (MTN) Blot U C) 
Human Cancer Cell Line Multiple Tissue Northem (MTN™) Blot) 

20 A Human multiple tissue Northem (MTN- 1 , Clontech) contaimng in each 

lane 2 mg of poly A + RNA firom eight different human tissues (heart, brain, 
placenta, lung, liver skeletal muscle, kidney, and pancreas) and a MTN-H human 
multiple tissue Northern, containing in each lane 2 mg of poly A + RNA firom 
spleen, thymus, prostate, testis, ovary, small intestine, colon and peripheral 

25 leukocyte, were hybridized according to the manufacturer's instructions and washed 
out in 0.1 X SSC, 0.2% SDS at 55°C. Also firom Clontech, a poly A + RNA blot firom 
human cancer cell Unes (melanoma G361, lung carcinoma A549, colorectal 
adenocarcinoma SW480, Burkitf s lymphoma Raji Leukemia Molt 4, lympohoblastic 
leukemia K562, HeLa S3 and promyelocytic leukemia HL60) was tested. 
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EXAMPLE 10 

Isolation of the fiill-Iength human ced-6 cDNA. 

Several human EST clones including hbc3123 have been identified through 
searching variety of database. The hbc3123 EST clone was completely sequenced. 
5 One pair of primers, P (ACAATTGCCAGCTTCATAG; SEQ ID NO.: 30) and Q 
(CTGTTTTCTTGTTTCAACATC; SEQ ID NO.: 31) have been designed on the 
region of PTB domain and subsequently tested for their specificity using human 
genomic DNA as a template. The result showed that the primers are specific. One 
set of AgtlO cDNA libraries (purchased from Clontech) including Brain, Heart, 

10 Kidney, Liver, Lung, Pancreas, Placenta, Skeletal Muscle tissues were tested using 
primers P and O to detect whether ced-6 is expressed in any of these tissues. 

The primer Q and a primer against A.gtlO vector were used to isolate several 
PGR fragments using brain and pancreas cDNA Ubraries. These PGR fi-agments 
were reamplified using the same primer set and sequenced. The sequence analysis 

15 suggested that these PGR fragments allows the extension of cDNA 130bp upstream 
of the initiation codon of human ced-6 coding region. The longest PGR fragment 
was then sent to human EST database to search for more EST clones which have 
overlap with the isolated PGR fragments but not the hbc3123 EST clone. The 
Genbank names of these three EST clones are R65982, R65983 and AA159394, 

20 respectively. These 3 ESTs together with the PGR fragment and hbc3 123 constitute 
the full-length coding sequence of human GED-6 and about 450 bp of 5TJTR. The 
human ced-6 cDNA sequenced is confirmed correctly by the sequencing data of 
hbc3123 EST clone, the sequencing data of the isolated PGR fragments and die 
sequence data of the many EST clones on the human cDNA region from human EST 

25 project. These human ced-6 cDNA data have suggested and guided any experiments 
shown in both Example 8 and Example 9. See Figure 32. 
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EXAMPLEll 

Human Tissue Distribution of Human CED-6 

This is a further example of the htunan tissue distribution. Two primers 
against the PTB domain were used to detect whether the cDNA libraries contained 
5 human ced-6. The two primers have been tested using human genomic DNA as a 
template and they are specific since no background amplification was detected. The 
result of this tissue distribution study is as follows: 
1. Information obtained fi-om cDNA library 

Tissue Presence of human ced-6 cDNA 

10 Brain ++ 

Heart ++ 
Kidney ++ 
Liver + 
Lung -H- 
15 Pancreas ++ 

Placenta ++ 
Skeletal muscle ++ 
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n. Information obtained from human EST project 



Tissue EST clones from 

human EST project 

Brain 2 

5 Testis 3 

Pancreas 4 

HCC ceU line 1 

Aorta 1 

Placenta 13 

10 Fetus 1 

Pooled sample 2 



EXAMPLE 12 

The technique known as FISH was carried out, the human ced-6 gene was 
localized to chomosomal position 2q32.3-q33. 

15 EXAMPLE 13 

Functional conservation between C elegans and human ced-6 homologues; 
overexpression of hCED Rescues the Engulfinent Defect of CED-6 Mutants in c. 
elegans: 

Given that signal transduction pathways are usually conserved through 
20 evolution, it is thought that the human ced'6 homologue (hereafter referred to as 
hced'6 which encompasses hi CED-6 and/or h3CED-6) might also be involved in 
promoting the phagocytic removal of apoptotic cells in mammals. To address this 
question, we tested the himian and worm ced-6 genes for functional conservation by 
overexpressing hced-6 in C elegans and determining whether it could functionally 
25 substitute for the endogenous ced-6 gene. 
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It is shown herein that overexpression of a C elegans ced-6 cDNA under the 
control of the heat shock promoters hspl6'2 and hsp 16-41 efficiently rescues the 
engulfinent defect in transgenic ced-6 mutant embryos. The same assay was used to 
test hced'6 for biological activity in C. elegans: constructs were created carrying the 
5 hced'^6 open reading frame under the control oi hsp 16-2 and hsp 16-41^ and ced- 
6(nl813) mutant animals transgenic for both constructs were tested for rescue of the 
engulfinent defect in late embryos and young larvae. It was found that heat-shocked 
embryos laid by transgenic mothers, but not non-heat shocked embryos, contained 
few cell corpses (Figure 31 A). These observations suggest that hced-6 can 
10 substitute, albeit weakly in the current assay, for C elegans ced-6 ^ supporting the 
concept that C elegans and human ced'6 are functionally conserved. Further 
assessment as shown in Example 13, showed successful rescue. 

Partial rescue, or even absence of rescue in certain assays, has been observed 
previously, even in cases where functional conservation has been estabUshed. For 
15 example, Wu and Horvitz (1998a) Nature 1998a 322 501-504, have found that 

DOCK180, the mammahan homologue of C elegans CED-5, efficiently rescued the 
distal tip cell migration defect of CED-5 mutants, but not the engulfinent defect. 



Experimental Procedures 

The open reading frame of hced-6 was PCR-amphfied using oligonucleotides 
20 flanking the start and stop codons, and subcloned into the heat shock vectors 

pPD49.78 and pPD49.83, previously digested with Kpn I and Sac I (see before). The 
two constructs were then injected into ced-6(nl813) animals as described previously 
to establish stably transmitting transgenic lines. 

To score for rescue of the engulfinent defect in embryos and in the adult germ 
25 line, transgenic animals were submitted to heat-shock and the number of cell corpses 
quantified as described previously herein. 
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Table 2 

Overexpression of human ced'6 homologue reduces the number of persistent 



cell corpses in ced'6(nl813) late embryos. 


Genotype 


Persistent cell corpses 


- heatshock 


+ heatshock 


Wild Type (N2) 




ced-6(nl813) +++ 


++ 


ced-6(nl813): hs::hced-6 


4- 



One of the isolated PGR fragments was fused to the hbc3123 EST clone. 
pLQhced-6.1, the fusion cDNA, has 130 nucleotides upstream of the initiation codon 

10 ATG. Two primers, Hhsl (GGGGTACCGAATTCTGATGGCAAC; (SEQ ID 
NO.:27)) and Hhs3 (CGAGCTCGATCAATAGTGAAGGTGAGG; (SEQ ID NO.: 
28)) were used to amplify the open reading frame of human ced-6 cDNA. The PGR 
fragment was digested subsequently with Kpn I and Sac I, and inserted into Kpn I 
and Sac I sites of both ppD49.78 and ppD49.83 heat shock vectors. The heat shock 

1 5 constructs, pLQhsl and pLQhs2, 50 ng/jil for each, whre then co-injected with a 
marker pRF4 (80 ng/jil) into the germline of adult ced-6(hl813) hermaphrodites. 
nced-6 was examined for its abiUty to rescue the engulfrnent defect in embryo 
progeny of ced-6(nl813) transgenic animals following an established procedure, as 
described herein. 

20 The rescuing abiUty of hGED-6 for the engulfrnent defect of ced-6(nl 813) in 

the adult gonas was also tested. Transgenic animals at L4/aduh molt were picked 
and put on a fresh plate. 36 hours later these animals were treated with a 45 minute 
heat shock at 33 "G. Twelve hours after the heat shock, cell corpses were scored in 
one gonad arm. Gontrol experiments, such as transgenic animals withut heat 

25 treatment, ced-6(nl813) animals at the same development stage with or without heat 
shock, were also used. These experiments show that overexpression of hced-6 
rescued the engulfrnent defect of GED-6 mutants in C. elegans in a germ line. These 
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experiments confirm that human ced-6's (e.g., h3CED-6) function mduces the 
phagocytosis of £^5optotic cells. Figures 3 1 A and 3 IB. 

EXAMPLE 14 

Sequences can be obtained in both deposits using T3 or T7 primers (either 
5 one or both can be used, they are at different sites of the actual insert). Both are 
commercially available from Clontech (#1227 and #1228) and sequence is shown 
below 

T7 primer: 5'(TAATACGACTCACTATAGGGAGA)3' (SEQ ID NO.: 25) 
T3 primer: 5'(ATTAACCCTCACTAAAGGGA)3' (SEQ ID NO.: 26) 
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CLAIMS 

An isolated protein selected from the group consisting of: 

a) a protein comprising the amino acid sequence as shown in 
SEQIDNo.2, 

b) a protein comprising the amino acid sequence as shown in 
SEQ ID No. 4, 

c) a protein comprising the amino acid sequence as shown in 
SEQ ID No. 6, 

d) a protein comprising the amino acid sequence as shown in 
SEQ ID No. 8, 

e) a protein comprising the amino acid sequence as shown in 
SEQ ID No. 10. 

f) a protein comprising the amino acid sequence as shown in 
SEQ ID No. 12, 

g) a protein compmrising the amino acid sequence as shown in 
SEQ ID No. 14, 

h) a protein comprising the amino acid sequence as shown in 
SEQ ID No. 16, 

i) a protein comprising an amino acid sequence which is at least 
40% identical to the amino acid sequence of SEQ ID No. 2, 4, 
6, 8, 10, 12, 14, or 16, and 

j) a protein comprising an amino acid sequence encoded by the 
nucleic acid sequence as shown in SEQ ID No*s 1, 3, 5, 7, 9, 
11, 13 or 15. 

An isolated nucleic acid selected from the group consisting of; 
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a) a nucleic acid comprising the sequence of nucleotides shown 
in SEQIDNo. 1, 

b) a nucleic acid comprising the sequence of nucleotides shoAvn 
in SEQ ED No. 3, 

c) a nucleic acid comprising the sequence of nucleotides shown 
in SEQ ID No. 7, 

d) a nucleic acid comprising the sequence of nucleotides shown 
in SEQ ID No. 9, 

e) a nucleic acid comprising the sequence of nucleotides shown 
in SEQ ID No. 11, 

f) a nucleic acid comprising the sequence of nucleotides shown 
in SEQIDNo. 13, 

g) a nucleic acid comprising the sequence of nucleotides shown 
in SEQ ID No, 15, 

h) a nucleic acid comprising the complement of the sequence of 
nucleotides shown in SEQ ID Nos. 1, 3, 5, 7, 9, 1 1, 13, or 15. 

i) a nucleic acid capable of hybridizing to a nucleic acid 
according to (h) under conditions of low stringency, 

j) a nucleic acid which encodes an amino acid sequence which is 
at least 40% identical to the amino acid sequences of SEQ ED 
Nos. 2, 42 6, 8, 10, 12, 14 or 16, and 

k) a nucleic acid which is at least 40% identical to the nucleic 
acid sequences of SEQ ID Nos. 1, 3, 5, 7, 9, 11, 13 or 15. 

An expression vector comprising a nucleic acid selected from the 
group consisting of: 

a) a nucleic acid comprising the sequence of nucleotides 

shown in SEQ ID No. 1, 3, 5, 7, 9, 1 1, 13, or 15, 
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b) a nucleic acid comprising the complement of the 
sequence of nucleotides shown in SEQ ID No. 1, 3, 5, 
7,9, 11, 13, or 15, 

c) a nucleic acid capable of hybridizing to a nucleic acid 
according to (b) under conditions of low stringency, 

d) a nucleic acid which encodes an amino acid sequence 
which is at least 40% identical to an amino acid 
sequence of SEQ ID Nos. 2, 4, 6, 8, 10, 12, 14, or 16, 
and 

e) a nucleic acid which encodes an amino acid sequence 
of SEQ ID Nos. 2, 4, 6, 8, 10, 12, 14, or 16. 

The expression vector of claim 3, comprising DNA encoding a 
reporter gene positioned in said vector such that expression of said 
nucleic acid results in expression of said reporter gene. 

The expression vector of claim 4, wherein said reporter gene encodes 
green fluorescent protein. 

A mammalian cell-line transfected with a nucleic acid selected from 
the group consisting of: 

a) a nucleic acid comprising the sequence of nucleotides 
shown in SEQ ID No. 1, 3, 5, 7, 9, 11, 13, or 15, 

b) a nucleic acid comprising the complement of the 
sequence of nucleotides shown in SEQ ED No. 1, 3, 5, 
7, 9, 11, 13, or 15, 

c) a nucleic acid capable of hybridizing to a nucleic acid 
according to (b) under conditions of low stringency 
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d) a nucleic acid which encodes an amino acid sequence 

which is at least 40% identical to an amino acid 
sequence of SEQ ID Nos. 2, 4, 6, 8, 10, 12, 14, or 16, 
and 

5 e) a nucleic acid which encodes an amino acid sequence 

of SEQ ID Nos. 2, 4, 6, 8, 10, 12, 14, or 16. 



7. A mammalian cell-line as claimed in claim 6, which is transfected 
with an expression vector comprising said nucleic acid and a reporter 
gene, said reporter gene being positioned in said vector such that 

10 expression of said nucleic acid results in expression of said reporter 

gene. 

8. A mammalian cell-line as claimed in claim 7, wherein said reporter 
gene encodes green fluorescent protein. 

9. A manunalian cell-line as claimed in claim 6, wherein said cell-line is 
15 selected from the group consisting of: a fibroblast cell-line, and an 

epitheUal cell-line. 

10. A mammalian cell-line as claimed in claim 6, wherein said cell-line is 
selected from the group consisting of: COSl, BHK21, L929, CVl, 
SWISS 3T3, HT144, IMR32, HEPG2, MDCK, MCF7, 293, Hela, 

20 A549, S W48 and G36 1 . 



11. 



A mammalian cell-line as claimed in claim 10, which is an MCF7 
cell-Une. 



PCT/US99/01361 



-72- 

A mammalian cell-line as claimed in claim 7, which is an MCF7 cell- 
line. 

A mammalian cell-line as claimed in claim 6, wherein said cell-line is 
a primary cell-line. 

A mammalian cell-line as claimed in claim 13 wherein said cell-line 
is selected from the group consisting of: human dermal FIBs, dermal 
keratinocytes, leucocytes, monocytes, and macrophages. 

A non-human transgenic animal comprising a gene encoding a 
protein, the protein selected from the group consisting of: 

a) a protein comprising the amino acid sequence as shown in 
SEQ ID No. 2, 

b) a protein comprising the amino acid sequence as shown in 
SEQ ID No. 4, 

c) a protein comprising the amino acid sequence as shown in 
SEQ ID No. 6, 

d) a protein comprising the amino acid sequence as shown in 
SEQ ID No. 8, 

e) a protein comprising the amino acid sequence as shown in 
SEQ ID No. 10, 

f) a protein comprising the amino acid sequence as shown in 
SEQ ID No. 12, 

g) a protein comprising the amino acid sequence as shown in 
SEQ ID No. 14, 

h) a protein comprising the amino acid sequence as shown in 
SEQ ID No. 16, 
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i) a protein comprising an amino acid sequence which is at least 
40% identical to the amino acid sequence of SEQ ID No, 2, 4, 
6, 8,10,12,14, or 16, and 

j) a protein comprising an amino acid sequence encoded by the 
nucleic acid sequence as shown in SEQ ID No's 1, 3, 5, 7, 9, 
11, 13 or 15. 

A transgenic nematode worm which is lacking all or substantially all 
of the function of its native CED-6 gene which has been transfected or 
transformed with a nucleic acid selected from the group consisting of: 

a) a nucleic acid comprising the sequence of nucleotides shown 
in SEQ ID No. 7, 9, 1 1, 13, or 15, 

b) a nucleic acid comprising the complement of the sequence of 
nucleotides shown in SEQ ID Nos. 7, 9, 1 1, 13, or 15. 

c) a nucleic acid capable of hybridizing to a nucleic acid 
according to (b) under conditions of low stringency, 

d) a nucleic acid encoding an amino acid sequence which is at 
least 40% identical to the amino acid sequences of SEQ ID 
Nos. 8,10,12,14, or 16, and 

e) a nucleic acid which encodes an amino acid sequence of SEQ 
ID Nos. 8,10,12, 14, or 16. 

The transgenic nematode worm of claim 16 which is C. elegans. 

A method for determining whether a compound is an inhibitor or an 
enhancer of a signal transduction pathway which promotes 
phagocytosis of apoptotic cells which method comprises exposing a 
transgenic mammalian cell transfected with a nucleic acid selected 
from the group consisting of: 
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a) a nucleic acid comprising the sequence of nucleotides shown 
in SEQ ID No. 1, 3, 5, 7, 9, 1 1, 13, or 15, 

b) a nucleic acid comprising the complement of the sequence of 
nucleotides shown in SEQ ID No. 1, 3, 5, 7, 9, 11, 13, or 15, 

c) a nucleic acid capable of hybridizing to a nucleic acid 
according to (b) under conditions of low stringency, 

d) a nucleic acid which and amino acid sequence which is at least 
40% identical to the amino acid sequences of SEQ ED Nos, 2, 
4, 6. 8,10, 12,14, or 16, and 

e) a nucleic acid which encodes an amino acid sequence of SEQ 
ID Nos. 2, 4, 6, 8, 10, 12, 14, or 16. 

to apoptotic particles and measuring the rate of phagocytic uptake of 
said particles by said transgenic cells in the presence and absence of 
said compound, wherein an increased rate of phagocytosis indicates 
an enhancer and a decreased rate of phagocytosis indicates an 
inhibitor. 

A method as claimed in claim 18, wherein said apoptotic particles are 
selected from the group consisting of: opsonized apoptotic 
neutrophils, opsonized apoptotic lymphocytes, opsonized apoptotic 
erythrocytes, opsonized killed bacteria and opsonized killed yeast. 

A method as claimed in claim 19, wherein said apoptotic particles are 
labelled. 

A method as claimed in claim 20, wherein said label is selected fix)m 
the group consisting of: a non- fluorescent dye, a fluorescent dye, a 
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non-fluorescent dye linked to an antibody and a fluorescent dye linked 
to an antibody. 

22. A method as claimed in claim 1 8, wherein the transgenic mammalian 
cell is a fibroblast cell or an epitheUal cell. 



5 23. A method as claimed in claim 22, wherein the transgenic mammalian 

cell is selected from the group consisting of: COSl, BHK21, L929, 
CVl, SWISS 3T3, HT144, IMR32, HEPG2, MDCK, MCF7, 293, 
Hela, A549, SW48 and G361. 

24. A method as claimed in claim 1 8, wherein said transgenic manraialian 
1 0 cell is a primary cell. 



25. A method as claimed in claim 24, wherein said transgenic mammalian 
cell is selected from the group consisting of: human dermal FTBs, 
dermal keratinocytes, leucocytes, monocytes and macrophages. 



26, A method as claimed in claim 2 1 , wherein the phagocytosed apoptotic 
particles are detected by a method selected from the group consisting 
of: light microscopy, fluorescence microscopy, quantitative 
spectrofluormetry, and flow cytometry. 



20 



27. 



A compound identified by the method of claim 18, as an inhibitor or 
an enhancer of a signal transduction pathway which promotes 
phagocytosis of apoptotic cells. 
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A method for determining whether a compound is an inhibitor or an 
enhancer of a signal transduction pathway which promotes 
phagocytosis of apoptotic cells which comprises: 

(1) introducing into a mammalian cell a protein selected from the 
group consisting of : 

a) a protein comprising an amino acid sequence as shown 
in SEQ ID No. 2, 4, 6, 8, 10, 12, 14, or 16, 

b) a protein comprising an amino acid sequence which is 
at least 40% identical with SEQ ID Nos. 2, 4, 6, 8, 10, 
12, 14 or 16 

c) a protein comprising an amino acid sequence encoded 
by the sequence of nucleotides shown in SEQ ID Nos. 
1,3, 5, 7, 9, 11, 13, or 15; or 

introducing into a mammalian cell a vector expressing RNA 
antisense that inhibits transcription of a protein in one of the 
groups a) to c) above, and 

(2) exposing the mammalian cell to apoptotic particles and 
measuring the rate of phagocytic uptake of said particles by 
said cell in the presence or absence of said compound wherein 
an increased rate of phagocytosis indicates an enhancer and a 
decreased rate of phagocytosis indicates an inhibitor. 

A method as claimed in claim 28, wherein said apoptotic particles arc 
selected from the group consisting of: opsonized apoptotic 
neutrophils, opsonized apoptotic lymphocytes, opsonized apoptotic 
erythrocytes, opsonized killed bacteria, and opsonized killed yeast. 
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30. A method as claimed in claim 28, wherein said apoptotic particles are 
labelled. 

31. A method as claimed in claim 30, wherein said label is selected from 
the group consisting of: a non-fluorescent dye, a fluorescent dye, a 

5 non-fluorescent dye linked to an antibody, and a fluorescent dye 

linked to an antibody. 

32. A method as claimed in claim 28, wherein the mammalian cell is a 
fibroblast cell or an epithelial cell. 

A method as claimed in claim 32, wherein the mammalian cell is 
selected from the group consisting of: COSl, BHK21, L929. CVl, 
SWISS 3T3, HT144, IMR32, HEPG2, MDCK, MCF7, 293, Hela, 
A549, SW48andG361. 

A method as claimed in claim 28, wherein said mammalian cell is a 
primary cell. 

A method as claimed in claim 34, wherein said mammalian cell is 
selected from the group consisting of: human dennal FIBs, dermal 
keratinocytes, leucocytes, monocytes, and macrophages, 

A method as claimed in claim 30, wherein the phagocytosed apoptotic 
particles are detected by a method selected from the group consisting 
of: light microscopy, fluorescence microscopy, quantitative 
spectrofluonnetry and flow cytometry. 



33, 

10 

34, 

15 35. 

36. 
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A compound identified by the method of claim 28 as an inhibitor or 
an enhancer of a signal transduction pathway which promotes 
phagocytosis of apoptotic cells. 

A method for determining whether a compound is an inhibitor or an 
enhancer of a signal transduction pathway which promotes 
phagocytosis of apoptotic cells which method comprises exposing a 
mammalian cell selected from the group consisting of: 

(1) a transgenic mammalian cell transfected with a nucleic acid 
selected comprising a nucleic acid sequence shown in SEQ ID 
No. 1, 3, 5, 7, 9, 11, 13, or 15, 

(2) a mammalian cell which expresses a protein selected from the 
group consisting of: 

a) a protein comprising an amino acid sequence as shown 
in SEQ ID No. 2, 4, 6, 8, 10, 12, 14, or 16, 

b) a protein comprising an amino acid sequence which is 
at least 40% identical with SEQ ID Nos. 2, 4, 6, 8, 10, 
12, 14, or 16; and 

c) a protein comprising an amino acid sequence encoded 
by the sequence of nucleotides shown in SEQ ID Nos. 
1,3.5, 7, 9, 11, 13, or 15, and 

(3) a mammalian cell which comprises a vector expressing RNA 
antisense to a protein selected from groups consisting of: 

a) a protein comprising an amino acid sequence as shown 
in SEQ ID No. 2, 4, 6, 8, 10, 12, or 16, 

b) a protein comprising an amino acid sequence which is 
at least 40% identical with SEQ ID Nos. 2, 4, 6, 8, 10, 
12, 14, or 16; and 
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c) a protein comprising an amino acid sequence encoded 
by the sequence of nucleotides shown in SEQ ID Nos. 
1,3,5,7, 9, 11, 13, oris, and 

to a compound to be tested and determining whether there is any 
change in the organization of the actin cytoskeleton, wherein an 
increase in the rearrangment of actin cytoskeleton indicates the 
enhancer, and a decrease in the rearragnement of actin cytoskeleton 
indicates the inhibitor. 

A method as claimed in claim 38, wherein the actin cytoskeleton is 
visualized with a fluorescent dye which is linked to a compound 
which interacts with F- actin. 

A method as claimed in claim 39, tv^herein said linker compound is 
phalloyidine. 

A method as claimed in claim 38, wherein the transgenic mammalian 
cell is a fibroblast cell or an epithelial cell. 

A method as claimed in claim 41, wherein the transgenic mammalian 
cell is selected firom the group consisting of: COSl, BHK2I, L929, 
CVl, SWISS 3T3, HT144, IMR32, HEPG2, MDCK, MCF7, U293, 
Hela, A549, SW48, and G361. 

A method as claimed in claim 38, wherein said transgenic mammalian 
cell is a primary cell. 
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A method as claimed in claim 43, wherein said transgenic mammalian 
cell is selected from the group consisting of: human dermal FEBs, 
dermal keratinocytes, leucocytes, monocytes, and macrophages. 

A compound identified by the method of claim 38, as an inhibitor or 
an enhancer of a signal transduction pathway which promotes 
phagocytosis of apoptotic cells. 

A method for determining whether a compound is an inhibitor or an 
enhancer of expression of a gene encoding a protein which 
participates in a signal transduction path way which promotes 
phagocytosis of apoptotic cells which method comprises: 

(1) exposing a transgenic mammalian cell as claimed in claim 7 to 
said compound, 

(2) measuring the level of expression of said reporter gene, and 

(3) comparing said expression with the level of expression of said 
reporter gene in the absence of said compound, wherein an 
increased rate of phagocytosis indicates an enhancer and a 
decreased rate of phagocytosis indicates an inhibitor. 

A compound identified by the method of claim 45, as an inhibitor or 
an enhancer of expression of a gene encoding a protein which 
participates in a signal transduction pathway which promotes 
phagocytosis of apoptotic cells. 

An antibody directed against an epitope of the protein selected from 
the group consisting of: 

a) a protein comprising the amino acid sequence as shown in 
SEQ ID No. 2, 
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b) a protein comprising the amino acid sequence as shown in 
SEQ ID No, 4, 

c) a protein comprising the amino acid sequence as shown in 
SEQ ID No. 6, 

d) a protein comprising the amino acid sequence as shown in 
SEQ ID No. 8, 

e) a protein comprising the amino acid sequence as shown in 
SEQ ID No. 10, 

f) a protein comprising the amino acid sequence as shown in 
SEQ ID No. 12, 

g) a protein comprising the amino acid sequence as shown in 
^ SEQ ID No. 14, 

h) a protein comprising the amino acid sequence as shown in 
SEQ ID No. 16, 

i) a protein comprising an amino acid sequence which is at least 
40% identical to the amino acid sequence of SEQ ID No. 2, 4, 
6, 8, 10, 12, 14, or 16 and 

j) a protein comprising an amino acid sequence encoded by the 
nucleic acid sequence as shown in SEQ ED No: 1, 3, 5, 7, 9, 
11, 13 or 15. 

An antibody as claimed in claim 48, which is a monoclonal antibody. 

A method of treating in an individual having a disease selected from 
the group consisting of: inflammation, autoimmune disease and 
cancer comprising administering to a patient a medicament 
comprising an effective amount of a protein selected from the group 
consisting of: 
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a) a protein comprising the amino acid sequence of SEQ ID No: 
8, 

b) a protein comprising an amino acid sequence which is at least 
40% identical to the amino acid sequence of SEQ ID No. 8 
and 

c) a protein comprising an amino acid sequence encoded by the 
nucleotide sequence of SEQ ID No, 7. 

A method of treating an individual having a disease selected from the 
group consisting of: inflammation, autoimmune disease, and cancer 
comprising administering to a patient an effective amount of a 
compound which is an enhancer of a signal transduction pathway 
which promotes phagocytosis of apoptotic cells. 

A method of treating an individual having a disease selected from the 
group consisting of: inflammation, autoimmune disease and cancer 
comprising administering to a patient an effective amount of a 
compound which is an enhancer of expression of a gene encoding a 
protein which participates in a signal transduction pathway which 
promotes phagocytosis of apoptotic cells. 

A method of treating an individual having a disease selected from the 
group consisting of: inflammation, autoimmune disease and cancer 
comprising administering to a patient an effective amount of a nucleic 
acid selected from the group consisting of: SEQ ID No. 7, 13, and 15. 

A method of treating an individual having a disease selected from the 
group consisting of: neurodegenerative disease, stroke, and sickle-cell 



PCTAJS99/01361 



-83- 

anaemia comprising administering to a patient an effective amount of 
a protein selected from the group consisting of: 

a) a protein comprising the sequence of amino acids of SEQ ID 
No.: 8, 14, 16 , 

b) a protein comprising the sequence of amino acids which is at 
lest 40% identical to the amino acid sequence of SEQ ID No.: 
8, 14, or 16, and 

c) a protein comprising an amino acid sequence encoded by the 
nucleotide sequence of SEQ ID No.: 7, 13, or 15. 

A method of treating an individual having a disease selected from the 
group consisting of: neurodegenerative disease, stroke and sickle cell 
anaemia comprising administering to a patient an effective amount of 
a compound identified as an inhibitor of a signal transduction pathway 
which promotes phagocytosis of apoptotic cells. 

A method of treating an individual having a disease selected from a 
group consisting of: neurodegenerative disease, stroke and sickle cell 
anaemia comprising administering to a patient an effective amount of 
a compound identified as an inhibitor of expression of a gene 
encoding a protein which participates in a signal transduction pathway 
which promotes phagocytosis of apoptotic cells. 

A method of treating an individual having a disease selected from the 
group consisting of: neurodegenerative disease, stroke, and sickle-cell 
anaemia comprising administering to a patient an effective amount of 
a nucleic acid selected from the group consisting of: 

a) a nucleic acid comprising the sequence of nucleotides of SEQ 
ID No. 7, 13, or 15, 
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b) a nucleic acid which hybridizes to the sequence of nucleotides 
of SEQ ED No. 7, 13, or 15, and 

c) a nucleic acid that encodes SEQ ID NO.: 8, 14, or 16. 

A phajrmaceutical composition comprising a protein selected from the 
group consisting of: 

a) a protein comprising the amino acid sequence of SEQ ID No. 
8, 14, or 16, 

b) a protein comprising an amino acid sequence which is at least 
40% identical to the amino acid sequence of SEQ ID No. 8, 
14, or 16, and 

c) a protein comprising an amino acid sequence encoded by the 
nucleotide sequence of SEQ ID No, 7, 13, or 15, 

and a pharmaceutically acceptable carrier. 

A pharmaceutical composition comprising a compound as claimed in 
claim 27 and a pharmaceutically acceptable carrier. 

A pharmaceutical composition comprising a compound as claimed in 
claim 37 and a pharmaceutically acceptable carrier. 

A pharmaceutical composition comprising a compound as claimed in 
claim 45 and a pharmaceutically acceptable carrier. 

A pharmaceutical composition comprising a compound as claimed in 
claim 47 and a pharmaceutically acceptable carrier. 
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A pharmaceutical composition comprising a nucleic acid as selected 
from the group consisting of: 

a) a nucleic acid comprising the sequence of nucleotides of SEQ 
ID No. 7, 13, or 15, 

b) a nucleic acid which hybridizes to the sequence of nucleotides 
of SEQ ID No. 7, 13, or 15, and 

c) a nucleic acid that encodes SEQ ID NO.: 8, 14, or 16. 



A method for identifying proteins which interact with the proteins of 
claun 1 , in a signal transduction pathway which promotes engulfiment 
of apoptotic cells comprising the steps of: 

(a) providing a host cell having a DNA construct comprising a 
reporter gene under the control of a promoter regulated by a 
transcription factor, which factor has a DNA binding domain 
and an activating domain, 

(b) expressing in said host cell a first hybrid DNA sequence 
encoding a first fiision of a fragment or all of a nucleic acid of 
claim 2 and either said DNA binding domain or said activating 
domain of said transcription factor, 

(c) expressing in said host cell at least one second hybrid DNA 
sequence encoding a pulative interacting protein together with 
the DNA binding domain or activating domain of the 
transcription factor which is not incorporated in the first 
fusion, and 

(d) determining any binding of the protein being investigated with 
a protein according to any of claims 1, 10 or 16 by detecting 
any production of the reporter gene product in said host cell. 
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An isolated protein from the nematode worm C elegans which 
comprises an amino acid sequence of from about amino acid residue 
242 to about amino acid residue 338 in Figure 2A or an amino acid 
sequence which differs from that of said protein only in conservative 
amino acid changes. 

An isolated protein which comprises an amino acid sequence which is 
from about amino acid 1 1 to about amino acid 190 in Figure 20 or an 
amino acid sequence which differs from that of said protein only in 
conservative amino acid changes. 

A method of diagnosis of a disorder in a patient which is associated 
with a defect of phagocytosis of apoptotic cells which comprises 
exposing a nucleic acid selected from the group consisting of; 

a) a nucleic acid comprising the sequence of nucleotides shown 
inSEQEDNo. 1,3,5,7, 9, 11, 13, or 15, 

b) a nucleic acid comprising the complement of the sequence of 
nucleotides shown in SEQ ID Nos. 1, 3, 5, 7, 9, 1 1, 13, or 15, 

c) a nucleic acid capable of hybridizing to a nucleic acid 
according to (b) under conditions of low stringency 

d) a nucleic acid which encodes an amino acid sequence which is 
at least 40% identical to the amino acid sequences of SEQ ED 
Nos. 2, 4, 6, 8, 10, 12, 14, or 16, 

e) a nucleic acid which encodes an amino acid sequence of SEQ 
ID Nos. 2, 4, 6, 8, 10, 12, 14, or 16. 

to a nucleic acid sample for the patient and detecting hybridization. 
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A method of diagnosis of a disorder in a patient which is associated 
with a defect of phagocytosis of apoptotic cells which comprises 
detecting a protein selected from the group consisting of: 

a) a protein a comprising the amino acid sequence of SEQ ID 
No. 8, 

b) a protein comprising the amino acid sequence of SEQ E) No. 
14, 

c) a protein comprising the amino acid sequence of SEQ ID No. 
16, 

d) a protein sequence having an amino acid sequence which is at 
least 40% identical to the amino acid sequence of SEQ ID 
Nos. 8, 14 or 16 or a protein sequence encoded by the nucleic 
acid sequence of SEQ ID Nos. 7, 13, or 15. 

in a sample from a patient with an antibody to an epitope of one of the 
aforesaid proteins. 



A protein which comprises a protein selected from the group 
consisting of; 

a) a protein comprising the amino acid sequence as shown in 
SEQ ID No. 2, 4, 6, 8, 10, 12, 14, or 16, 

b) a protein comprising an amino acid sequence which is at least 
40% identical to the amino acid sequence of SEQ ID No. 2, 4, 
6, 8, 10, 12, 14, or 16, 

c) a protein comprising an amino acid sequence encoded by the 
nucleic acid sequence as shown in SEQ ED No's 1, 3, 5, 7, 9, 
11, 13, or 15, 

wherein said protein is fused to another protein. 



PCTAJS99/0136t 



-88- 

A protein as claimed in claim 69, wherein said other protein is an 
epitope tag or the product of a reporter gene. 

A method for identifying whether a compound is an enhancer of a 
signal transduction pathway which promotes phagocytosis of 
apoptotic cells which comprises the steps of: 

(1) exposing a C. elegans, in which the expression of CED-6 is 
defective or otherwise suppressed, to a compoimd to be tested 
and 

(2) scoring for return to wild-type phenotype. 

A method for determining whether a compound is an enhancer or 
inhibitor of a signal transduction pathway which promotes 
phagocytosis of apoptotic cells which comprises the steps of: 

(1) exposing a transgenic C elegans as claimed in claim 17 to the 
compound to be tested, and 

(2) measuring the level of phagocytic activity by scoring apoptotic 
corpses in the heads of LI larvae and/or the gonads. 

An isolated protein which is an adaptor molecule in a signal 
transduction pathway which regulates phagocytosis of apoptotic cells. 

Use of a protein or nucleic acid for use in therapy, e.g., inflammation, 
autoimmune disease or cancer, comprising: a protein comprising an 
amino acid sequence of SEQ ID NO.: 2, 4, 6, 8, 10, 12, 14, or 16, a 
nucleic acid that encodes SEQ ID NO.: 2, 4, 6, 8, 10, 12, 14, or 16, an 
amino acid sequence encoded by SEQ ID NO.: 1, 3, 5, 7, 9, 1 1, 13, or 
15, or a nucleic acid sequence of SEQ ED NO.: 1, 3, 5, 7, 9, 1 1, 13, or 
15. 
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Use of a protein or nucleic acic for use in therapy, e.g., 
neurodegenerative disease, stroke or sickle cell anemia, comprising: a 
protein comprising an amino acid sequence of SEQ ID NO.: 2, 4, 6, 8, 
10, 12, 14, or 16, a nucleic acid that encodes SEQ ID NO.: 2, 4, 6, 8, 
10, 12, 14, or 16, an amino acid sequence encoded by SEQ ID NO.: 1, 
3, 5, 7, 9, 11, 13, or 15, or a nucleic acid sequence of SEQ ID NO,: 1, 
3,5, 7, 9, 11, 13, or 15. 
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FIG, 1 A- 



FIG. 1 e. 



FIG. 1 



FIG. 1 D. 
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Overexpression of cedS Rescues 
the Engulf ment Defect of ced-G Mutant 
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Overexpression of ced-6 Rescues the Engulfment Defect of 
ced-6 Mutant During Embryonic Development 
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Can cec/-6 also promote the engulf ment 
of Persisting corpses? 
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Overexpression of ced-S Rescues the Engulfment Defect 
in the Adult Gonad, and Might Act in Somatic Sheath Ceils 
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GGTGATGAGCCCTTGGGTTCTCGCTCCGACTGCTAAATTCGCTTGGCCGGGTCCACCTTCT 

CGTGGCCTCACTCGCCACACGGATCAGAATCCGGAGCAGGCAGTTCTCTCTATTCTGAGGC 

TCCTGCGGCTGCCGCGCTGACTTCCCTGTGTGGNGGAGGGAACTCTGGGCAGGCTGGTTTT 

CTTGGAATGTGTTTACGATGTTGAATGGGACTTGAACAGGAAGCTGGACGCTGCAGCTGG 

AACTAGCGTGCCAAGTTATTTATGATTCCATCTGATATACATAGGAGAGAAACTGATAGA 

AGAATTCTGATGGCAACTGTATGATAGAAGCTATATAAAGTCAAGTGTCCATTTTCTTTCA 

ACTATATTTGAGCAT^CCCAGGATTTAAGTCGTGGAACTGAACATTTATTTGGC^^ 

CAT CATG AACCGTGCi'riTAGCAGGAAGAAAGACAAAACATGGATGCATACACCTGAAGC 

TTTATCA.\AACATTTCATTCCCTATAATGCAAAGTTTCITGGCAG 

CCAAAAGGAACAGA.\GTTGTGAGAGATGCTGTAAGGAAACTAAAGTTTGCAAGACATAT 

CAAGAA.\TCTGAAGGCCAGAAAATTCCTAAAGTGGAGTTGCAAATATCAATTTATGGAGT 

A.AAAATTCTAGA,\CCCA.AA.ACAAAGG AAGTrCAACACA.\TrGCCAGCTTCATAGAATATC 

n illGTGCAGATGATA.A,\.^CTGACAAGAGGATATTCACTTTCATATGCAA.\GATTCTGAG 

TCAA.^TAAACATTTGTGCTATGTArrTGACAGCGAAAAGTGTG GTG.'^J^^ 

CAATrGGCCAAGCATTTGACCTGGCATACACGA.AATTTCTAGAATCAGGAGGAA.\AGATG 

TrGAAACAAGAAAACAGATCGCAGGGTTACAAAAAAGAATCCAAGACTTAGAAACAGAA 

AATATGGAACTTAA.\.\-\TAAAGTACAAGATTTGGAAAACCA.ACTGAGAATAACTCAAGTA 

TCAGCACCTCCAGCAGGCAGTATGACACCTAAGTCGCCCTCCACTGACATCTTTGATATGA 

TTCCATTTTCTCCAATATCACACCAGTCTTCGATGCCTACTCGCAATGGCACACAGCCACC 

TCCAGTACCTAGTAGATCTACTGAGATTAAACGGGACCTGTTTGGAGCAGAACCrTTTGAC 

CCATTTAACTGTGGAGCAGCAGATTTCCCTCCAGATATTC,AATCAAAATTAGATGAGATGC 

AGGAGGGGTTCA.AAATGGGACTAACTCTTGAAGGCACAGTATTTTGTCrCGACCCGTTAG 

ACAGTAGGTGCTGACATCAAGAACAAGAAATCCTGATTCATGTTA.'^ATGTGTTTGTATAC 

ACATGTCATTTATTATTATTACTTTAAGATAGGTATTATTCATGTGTCAATGTIT^ 

TITTAATATTTTGAA.\.\TTTTCTCAGTTAAATTTCCT^ 

TATTTTAAAAACAGCTTACTGTAAAGTAGATCATACl'll'iATGTTCCTTTCTGTTTCT 

TAGATGAATTTGTA-=\TTG.-^\AGACATATTATACAAATACCTGCCTTGTGTCTGAGTTCTAT 

TTAGTTAGCATCTTGA\ATTTGTATTCATTTTCCAGATGGCTAGTTTA 

AAAGCCATACCTTAAAGATAACTTTTTAAATTCTGAAGAGACATGCCAATGTCA-'5u'\CTA.^ 

ACATGTTCTGTll"il.\.\ACCAACAAACATGTTACTATTCATTGGACAGATATCATTTTA 

ATAAATACTGTTCACATCACTGGGAAAATGTAAACTTTA,AACATAATGCCACAAGGTCAC 

TAATTTCTAGCAGGTAAAATTATAAGGATATAAATTCCAATAATAA-^CCAAATGTATTTAG 

AGTATTTATTAGT.A.A^TGCAAGGTGATGTTAGTTATGATCAGTTATACTCTAAATATTT/^^ 

TTTGTTTTATA,AAGGTAGTGAAAAAATGAA.A/\TrTGCTATTTAT^ 

CATTCCAAATGAGATA.\GTGATATTACTATAACATCTAAGCATCATCTGATTTGATATTCC 
CTA.«WVAACATTTGG.\ATATATGCTATCTATAGATTCAGTATCTACTACCCATATT^ 
ACCAAATATATTTCTCCTCACTGCATAAGGACTACTCTTCTCATA'l l I ICITCTTTGATGAA 
GATAl I I 1 ICACCAAAGTTTATTTTGTGATGCCCTCTTGGTTrrGATACTTTAAA-^TCTGTG 
GCACCCGTTCTACATGAATTATCAATATTrGGTAAATrCAATCTGTATTTGl'i f'l GTTAAAG 
TCAAAAATCTCATTTrCCAAAAA-A.\AAAAAAAAAAACTCGAG 
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GGTGATGAGCCCTrGGGTTCTCGCTCCGACTGCTAAATTCGCTTGGCCGGGTCCACCTTCT 
CGTGGCCTCACTCGCCACACGGATCAGAATCCGGAGCAGGCAGTTCTCTCTATTCTGAGGC 
TCCTGCGGCTGCCGCGCTGACTTCCCTGTGTGGNGGAGGGAACTCTGGGCAGGCTGGTTTT 
CTTGGAATGTGTTTaCGATGTTGAATGGGACTTGAACAGGAAGCTGGACGCTGCAGCTGG 

aactagcgtgccaagrratttatgattccatctgatatacataggagagaaactgataga 
agaattctgatggca.\ctgtatgatagaagctatataaagtcaagtgtccattttctttca 
actatatttgagca taccc aggatttaagtcgtggaactgaacatttatttggctgatcct 
catc atg aaccgtgcttttagcaggaagaaagacaaaacatggatgcatacacrtgaartr 

tttatcaaa.acatttcattccctataatgcaaagtttcttggcagtacagaagtggaaca 

CCAAAAGGAACAGA-AGTTGTGAGA.GATGCTGTAAGGAAACTAAAGTTTGCAAGACATAT 

CAAGAAATCTGA.AGGCCAGA*\AATTCCTAAAGTGGAGTrGCAAATATCAATTTATGGAGT 

A.AAAATTCTAGA.\CCCAA.\ACAAAGGCTGAAGAGATCACTTTAACAATTGGCCA.^ 

TGACCTGGCATACACGAAATTTCTAGAATCAGGAGGAAAAGATGTTGAAACAAGAAAAC 

AGATCGCAGGGTTACAAAAAAGAATCCAAGACTTAGAAACAGAAAATATGGAACTTAAA 

AATAAAGTACA.AGATTTGGA-AAACCAACTGAGA.ATAACTCAAGTATCAGCACCTCCAGCA 

GGCAGTATGACACCT.AAGTCGCCCTCCACTGACATCTTTGATATGATTCCATTTTCTCCAA 

TATCACACCAGTCTTCGATGCCTACTCGCAATGGCACACAGCCACCTCCAGTACCTAGTAG 

ATCTACrGAGATT,A.'\-\CGGGACCTGTTTGGAGCAGAACCTriTGACCCATTTAACTGTGGA 

GCAGCAGATTTCCCTCCAGATATTCAATCAAAATTAGATGAGATGCAGGAGGGGrrCAAA 

ATGGGACTA.ACTCTTGA.AGGCACAGTATTTTGTCTCGACCCGTTAGACAGTAGGTG CTGA 

CATCAAGAACA.AGAA.\TCCTGATTCATGTTAAATGTGTTTGTATACACATGTCATTTATTA 

TTATTACTTTAAGATAGGTAT^ 

AATTTTCTCAGTTA.VATTTCCTCACCTTCACTATTGATCTGTAA'il'rr 

CTTACTGTAAAGTAGATCATACl'T "1 ' 1 ATGTTCCTTTCTGTTTCTACTGTAGATGA.ATTTGTA 
ATTGAAAGACATATTATACAAATACCTGCCTTGTGTCTGAGTTCTATTTAGTTAGCATCT^ 
GA.AATTTGIXTTCATTTTCCAGATGGCTAGTT^ 

AAGATAACTrTTTAAATTCTGAAGAGACATGCCAATGTCAAACTAAACATGTTCTGTTT^ 

AAACCAACAAACATGTTACTATTCATTGGACAGATATCATTTTATGTATAAATACTGTTCA 

CATCACTGGGAAA-^TGTAAACTTTAAACATAATGCCACAAGGTCACTAATTTCTAGCAGG 

TAAAATTATAAGGATAT.A.A.ATrCCAATAATAAACCAAATGTATTTAGAGTATTTATTAGT^ 

AATGCAAGGTGATGTTAGTTATGATCAGTTATACTCTAAATATTTAATTTGTTT^ 

GTAGTGAA-A.AAATG.A.\,A.ATrrGCTATTTATTAAAAAACATTAAATTTCA 

AT.AAGTGATATrACTATAACATCTAAGCATCATCTGATTTGATATTCCCTAAAAAACATTT 

GGAATATATGCTATCTATAGATTCAGTATCTACTACCCATATTTACTTTACCAAATATAT^ 

CTCCTCAC TGCA TAAGGACTACTCTTCTCATATTTTCTTCI^ 

AAAGTTTATTTTGTGaTGCCCTCTTGGTTTTGATACTTTAA 

ATGAATTATCAATATTTGGTAAATTCAATCTGTATTTGTTTTGTTAAAGTCAAAA.^TC 
rTTCCAAAAA.A.-\A.A.\.JLAAAAAAACTCGAG 
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NINRAFSRKKDKT%V-NIHT?EALSKHFIPYNAKFLGSTEVEQPKGTEVA^A^^^ 

pn^PKrAPPITLTIGOAFDL\YTKFLESGGKDVETRiCQ(AGLQKJlIQDLETENMELK>.^ 
M^5v?APPAGS^r^PK^^^^ 

p^FFNCGAABFPPDIQSKLDENtQEGFK-MGLTLEGTV'FCLDPLDSIU:*- 



M^^lAJFSRJCKDKTW^^HTPEALSKHFIP 

EGQKIPKV1ELQISrYGVKILEPKTKAEEITLTIGQAFDLAYTXFLESGGKX»VET^ 

DLETENMELK>nK:VQDLENQLRITQVSAPPAGSMTPKSPSTDIFDMIPFSPISHQSSi\^T^ 

PPVPSRSTEIKiUDLFGAEPFDPFNCGAADFPPDIQSKLDEMQEGFKMGLTLEGT^ 

C* 
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FIG. 22 



CGTCATGAQC CCTTGGCTrC TCGCTCOGAC TGCTAAATTC GCTIGGCCGG GTCCACCTTC TCGTCGCCTC ACTCOCCAO. CGOATCAGAA rtCCGGAGCAC 100 

GO^OTTCrrCT CTATTCI^AG GCTCCTCCC=Q CTGCCGGCT^ i^CTICCCrGT GTGCGQQAGG G^ACTCTGGG O^CMGGTT ^TCrrOGAAT GTQTTTACCA 200 

axmX^AA^ GACnGAACX GGAAGCTGGA CGC3«CAGCr OGAACTi^CG TQCCAAO™ TTTATOArrC CATCICATAT ACATAGGAGA GAAACTGATA 300 

OAAOAATrCT GATGGCAACT CTA^AXAGA AGCTAl^ATA AAGTCAAGTG TCCArTTTCT TTCAACTATA TTTGAGCAXA OCCAGGATTT AAG.«GTGGA 400 

TraRCecTGC TTTEAQCAGG AAGAAAGACA AAACATGGAT GCATACACCT GAAGCTTTAT CAAAACATTT SCO 

ACTGAACATT TATTTGGCTG ATCCTC&'fiC^ TGAACCGTGC TTTTM*^ wvuju^^h^ T W K HTP E >T. S KHP 

«,i^»riiaf-rr GAACAGCCAA AAGGAACAGA AGTTQTQAGA GATCCTGTAA aGAAACTAAA OTTTQCAAGA 600 

CRTTCCCTAT AAaXJCAAAGT TTCTTGGCAG TWMAACTG GAACAGCCAA M^i^i i^^^ yyR pRVft KT -^f fAR 
V p Y HAK^ t.Gg TBV fcufn. xj * 

^ PTB domain 

a.»cy^ lu^^xo..^. cc^^ .xtxco^t >TC>ym^T Txen^g: cy^y:^ 

LItxL CCTIC^T>C^ A^AXCTXTT O.^^^ X^i"-^ ^^"^^ "TT^"' ^ ^"^^ ^^^^ 

«-»K:llQKrC ACTTTAJVCAA TTCGCCAAaO ATTTBRCCIG GCHTRCACQX AATTTCTAGA ATCSCSMGA 900 

Charged regfon * 

0«=«A^ C.CCr,C»CC ^C«^T3 AOCCTy^ CC^C tyCyTCgX. g.X.^ATTC CTa^C 

] , «v^^.m«rr caATGGCACA CAGCCACCTC CAGTACCTAQ TAQATCXACT GAGATTAAAC GGGACCTGTT TO3AGCAGAA 1200 

AATATCACAC CICTCITCGA TCOCTAirCOG C^^^^^ g^ S^T^ V _P R 5 T E T K ^ D L F Q ^ . 

pfoMne/sorine rich region (potential SH3 binding domain) 
ccxxr^ g.^nycra ^y:. a^Trrcc=TC c.^.Tyr:c. g^xg^ATCc c^«go^ c^^c^, ""^ 

TaLwaL aL^AA^AT LLr^A^ CAKOTTCAA TGTTTTIGAA TATTTBUVTA TITTCAAAAt TTTCTCAGIT AAATTTCCrC ACcflCACTA ISOO 

TTGMWSTA AtnrrrATrx taaaaacacc ttacbhaaa gtagatcata cTTrr«««T ocTncrorr TCTACTGTAG A«5AATIT« AATTOAAAGA 16O0 

CATATIATAC AAATACCTGO CIXGWICW AGTrCXATrT ACTTAGCATC TI«AAATTK TATTCATTTT CCAG^TGCCT «^A^ TGATTtOCCA "00 

AA*<«»XAC CTTAAASASA ACtTTTtAAA TTCTC^ ACATOCCAAX OTCAAACTAA ACATGTTCIO «TITAA*CO AACAAACAW .TTA^TATTCA «00 

TWOACASAT ATOTTTEM GXATAAATAC TCTTCACATC ACTGGGAAAA -TSTAAACTTr AAACATAATO CCACAAC^^ ACTAArPTCT A0CAG8XAAA "00 

ATTATAACGA TAXAAATTCC AA^TAAAC CAAATOTAXT TAGAGTATTT ATTA^mAAT GCAA««C«r CrTACWATa ATCAO«ATA CTCTAAATA? 2QO0 

„A«m«W TTAXAAiGGT ACtOAAAAAA TGAAAATTO CTATTTATTA AAAAACATTA AATTICATXC CAAATGAOAT AAOTGA^TT ACTATAACA^ »0C 

CTAACCATCA «:rCATTI«A TATtCCCTAA AAAACATI^ GAATATATOC TATCTATAGA «CAOTA=CT ACIACCCATA TTTACTTTAC CAAATATATT «00 

«««CACT CCATAAfiOAC TACTCWCTC AaATtTTCTT CTTTGATGAA CATATTl^ ACCAAAGTTT ATTTWrOAT GCCCICTTGO TTtlGATACT "00 

TXAAAAKTG 1«5CACCCGT TCTACA^Aa' «ATCAATAT «GGTAAArr CAAK«TAT tTSTmCrr AAAOICAAAA ATCTCATTTT CCAAAAAAAA »00 

2412 

AAAAAAAAAA AC 
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SIM output with, parameters; substitution scores in BL0SUM6a O « 12, E « 4* 



Sequence 1: CEO- 6, 
Sequence 2: hCSO€« 



4 92 residues 
3 04 residues 



List of local alignments with score >« 3S.0 

47,5% identity in 184 residues overlap; Score: 3 8e.O; Gap frequency: 2.7% 



CED-fi, 
hCEDfi. 



4 S RTWIHP PD YLINGKVE YVARFI/3CVETPKAWGSDVAREAIHAIRFQRDLKRSEQTRETAK 
1 1 KTWMHT PEALSKHFI PVUAKFIiGSTEVEQPKGTEWRDAVRKIjKFAJlHI KKSE GQK 



CED-6, 

hCEoe , 



10 5 LQICVEIRlSIDMVriAD I KTKAPMYTFPLGRISFCADDKDDKRMFSFIARAEGASGKPSC 
67 IPKVELQrSIYGVKILEPKTKEVQKNCQLKRXSFCADDKTDKRIFTFICK-DSESMKKI/: 



CED-fi, 
hCED£, 



165 YAFTSEKIJUEI)ITI*TIGEAFDIAVXRFLDKNRTSI*ENQKQIYILraCKIVELETENQVLIE 
126 YVFDSEKCAEEITLTIGQAFDLAVTKFLESGGKDVETRKQIAGLQKRIQDLETENMELKN 



CED-6, 
hCEDG, 



22S RIAE 
18 o KVQD 



31.6% identity in 38 residues overlap; Score: 38.0; Gap frequency: 0.0% 



CED-6, 
hCEDfi . 



265 PNIPPSSIYSMPRANDLPPTEMAPTLPQISTSSNGASP 
221 PFSPISHQSSMPTPNGTQPPPVPSRSTEIKRDLFGAEP 
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FIG. 25(A) 
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FIG. 25(C) 
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Figure 28A . 
hced-6 alignment Formatted Alignment 



a65983/genl»nk. — 

//hcGd'S Ctm /'GGTGATCAGC CCTTOGGrrC TCGGTOOGAC TGCTAAATTC QCTTOGCCGG 50 

Tha PGR fragmeat 

hbc3123 EST clona 

/^S'/Rfi58Q2/gcnbank . : /'QCTAAATTC GCTTOGQOGO 19 

^5*/Aia59394/genban)c /tmOATGAGC CCnQOGnC TCGCTCCGAC TGCTAAATIX: GCTrGGOCGG SO 

Consensus GGTOATGAGC CCrXGGGrrC TCGCTCCGAC TGCTAAATTC GCTIGGCCGG SO 

1165983 /genbajik, • 

^'hced-e CDNA GTCCAOCTTC TaTTGGCXnC ACTCGCCACA aSCSWrGRGAA TCOGGAGCAG 100 

The "PGR fragmant 

Iibc3l23 EST clone : 

^5*/R65882/genbanfc QTCCACCTTC TCGTGGCCTC ACTCGCCACA OQGATCAGAA TCXSSGAQCAG 69 

iyS'/AAl59394/genbaak QTCCACCTTG TCGTGOCCrC ACTCGCCACA OCSGATCAGAA TCOGGAGCAG * 100 

Consensus GTOCRCCTTC ICGTGGCCTC ACTCGCCACA CGGATCftGAA TOOGGAGCAG 100 

^hcBd-fi'^cm^^ GCAGTTCTCT CTATTCTGAG GCTGCTGCGG C-^TC3GCGGC- TGACTTCCC- 147 

The PCR fragment ' 

hbq3ia3 EST clone r 

/>'S*/R65flB2/genbank ' GCAGTTCTCT dATTCTGAG GCTCCTGOGG C-TGOCGGC- TGACITOCC- 116 

// SVAAlS9394/genbank GCA GTl t, avr CTATTCTGAG GCTCCTGOGG CWTGCCMGCG TGACTTCCCG ISO 

Consensus C-CAGTTCTCT Cr A TTCTGAQ GCTCCTGCGG CMTGCCNGCG TeSACTTCCCG ISO 



RS5983/genbank. 
//hcad-6 cDNA 
Tha PCR fragment 
hbc3133 EST clone 
/'S ' /R6S382/genbank 
// S • /AA1S93 94 /genbarik 

.Consensus 



TGTGTGOGGG AQGGAACTCT QGGCAGQCTG GTTrTCTrGG AATGTGTTEA 



TOTGTGCGGG AGGGAACTCT GGGCAGGCTG GTmCTTOG AATCJrGTTTA 
TOTGTGGNGG AOGGAACTCT GGGCAGOCTG GTTrTCrrGG AATSTGTTEA 

TGTGTGSNGQ AGGGAACICT GGGCAGGCTG GITrrCTTCH AATtSTGTTTA 



191 



166 
200 

200 



^R559a3/genbank. 

i7 hced-6 ciSTA 
The PCR fragment 
hbc3X23 EST clona 

J S ' /R65a02/gehbank 

^ 3' /AAlS9394/genbank 

Consensus 



XfcTTGA AACGQGNRAC CGOQCCNCTG CAAGCKGGAA 

CGATGTTGAA TCOGPlCTTG- AACAGGAAGC T-GGACQCTG C-AQCTGGAA 



CGATGTTCAA TGGGACTTG- AACAGGAAGC T-QGACGCTG C-AGCTGGAA 
CGATQTTSAA TGGGACTTG- AACAGGAAGC T-GGACGCTG C-AGCTGGAA 

CGATGTTCAA TGGGACTTGA AACBGGNARC YGGGMCNCTG OIAGCNGGAA 



35 
244 



213 
247 

250 



/y R65983 /genbank . 

^ hced-6 CDNA 
The PCR Cragaienc 
hbc3l23 EST clone 

iJ'S ' /R6S882/genbank 

// 5 • /AA159394/genbank 

Consensus 



CTACCGTGCC CAAGTTATTr ATGAt4COCCA CCT3ATAIAC ATGGGAGAGA 65 
CTAGCGTG-C CAAGTTATIT ATGA-TTCCA TCTtSATATAC ATAGGAGAGA 292 



CTAGCGTG-C CAAGTTATTT ATGA-TTCCA TCTGRTATRC ATAQGAGAGA '261 
CTAGCGTG-C CAAGTERITT ATGA-TTCCA TCTGNTATAC ATAQGAGAGA 295 

CTASCGTGCC CAAGTTATIT ATQANyTCCA yCTC^flATAC AIRGQAGAGA 300 



^R6S983/genbank. 

^hced-6 cONA 

JlTSiia PCR fragment 
hbc3123 EST done 
5 VRS5882/genbank 

^ 5 ■ /AAlS9394/genbank 

Cansmi3%XB 



AAC-TOATAG AAGAATTCTG ATGGCAACTG TATGATAGAA GCTA-TATAA 
AACT-GATAG AAGAATTCTG ATGGCAACTO TATGATAGAA GCTATTATAA 
-«/J^VAGAAnCTG ATGGCAACTG TATGATAGAA GCTATTATAA 



AACT-GATAG AAGAATTCTQ ATGGCAACTG TATGATAGAA CCTAT-ATAA 
AACTTGATAG AAGAATTCTG ATOGCAACTO TATGATAGAA 6CTA-TATAA 

AACTTGATAG AAGAATTCTG ATGGCAACTG TATGATAGAA GCTATTATAA 



.133 
341 
40 

309 
344 

3S0 



yrR659B3/genbank. 
^hced-6 craxA. 
//The PCR fragment 

hbc3l23 £ST clone 
^ 5 * /R6S882/genbank 
^ S * /AA159394/senbank 

Consensus 



AGTCAAGTGT CCATTTTCTT TCAACTATAT TTOAGCATAC CCAQGATTTA 
AGTCAAGTGT CCATPTTCTT TCAACTATAT TTGAGCATAC CCAGGATTTA 
AGTCAAGTGT CCATTTTCTT TCAACTATAT TTGACCATW: OCAGGATTIA 



AQTCAAGTGT CCATTTTCTT TCAACTATAT TTGAGCATAC CCAQGGTTEA 
AGTCAAGTGT CCATnTCTT TCAACTATAT TTGAGCATAC CCAGGATTTA 

AGTCAAGTGT CCATTnCTT TCAACTATAT TTGAGCATAC CCAGGRTTEA 



183 
391 
90 

359 
394 

400 



^^^65 983 /genbank . 
//hced-6 cENA 
./The PCR fragment 

hbc3123 EST clone 
jys • /R6S8a2/genbank . 
//S • /AAlS93g4/gcnbank 

Consensus 



AGTCQTGGAA CTGAACATTT A'l'nUCTGA TCCTC ATCAT -GAACCGTGC 232 

AGTOQTGGAA CTGAACATTT ATTTGGCTGA TOCTCATCAT G-AACOGTGC 440 

AOTCaTGGAA CTQAA6ATTT ATTTGGCTGA TOCTCATCAT G-AACCGTGC 139 

AGTCOTOGAA OTGAACATIT ATTTOGCTCA TCCTCATCAT GGAACCGTGC 409 

AGTCGTGGAA CTGAACATT . AT 415 

AGTCGT GG AA CTGAACATTT "aTTTGQCTGA TCCTCATCAT GGAACCGTXSC 4S0 
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Figure 28B 
hced-6 alignment Formatted Alignment 



^F659a3/genban)c. mTAGCAGQ AAGRAAGACA AAACATGGAT QCATACACCT GAA£3CTTIAT 2B2 

^hced-6 CDtJK TTTTAGCAGG AACAAAOACA AAACA.TGGAT GCATACAOCT GAAGCTTTAT 490 

PCR Eragmafit TTTTKXACG AAGRAAGACA AAACATGGAT QCA.TACAOCT GAAGCTTTAT 139 

hbc3123 EST clone 

^5*/R65a82/genbank rrmGCAOG AAGAAAGACA AAACATGGGT GCTUACACCT GAAO-NTTAT 458 

S'/AAlS9394/genbanlc r 415 

Consensus TTTTAGCAGa AAGAAAGRCA AAACATGGRT GCWNACACCT GAAGOmAT 500 

<f RfiSSaa/genbank. CAAAACATCT CATTCCCEAT AA.TQCAAAGT nCTTGGCAG TACAGAAGTQ 332 

^•hced-€ CENA CAAAACATTT CA3TCCCEAT AATOCAAAGT ^TTCTIGGCAG TACAGAAGTG 540 

// The PCR fragmenC CAAAACATTT CATTCCCTAT JUVaXXSO^AGT TICTTGGCAG TACAGAAGTG 239 

hbc3123 EST clone — — — 

rf'S'/R6S882/geiibank CAAAACNTT- C-TTTCC NAT TT r— . 478 

5VAA159394/genbank 415 

Consenaua CAAAACNTTT CATTYGCCAT NATGCAAJU3T TTCTIGGCAa TACAGAAGTG 550 

^RS59e3/geribanJc. GAACAGCCAA AAGGAACAGA AfSTIGTGAGA GATGCTQTAA GGAAACEAAA. 382 

hced-S cCNA GAACAGCCAA AAGGAACAGA AGTTQTGAGA OATGCTGTAA GGAAACTAAA 590 

// Ttie pen fragaenc GAACAGCCAA AAGGAACAGA AGTTGTGAGA GAOXSCTGTAA GGAAACTAAA 289 

hbc3123 EST clone 

5'/R65a82/genbank 478 

5'/AAl593S4/genfc5n]c 415 

Consonsus GAACAGCCAA AAGGAACAGA AJTTIGTGAGA GATGCTGTAA GGAAACTAAA 600 

yR65983/genbank. GTTTOCAAGA CAaWTCAAGA AATCTGAAGG CCAAAAAA = 420 

hced-6 CDNA GTITQCAAGA CATATCAAGA AATCTGAAGG CCAGAAAATT CCTAAAGTGG 640- 

^The PCR fragment GTTTOCAAGA CATATCAAGA AATCTGAAGG CCAGAAAATT CCTAAAGTGG 339 

bbc3123 EST Clone 

5VR65882/genbank 478 

5'/AAl59394/genbank 415 

Consensus GTTTSCAAGA CATinC2U\GA AATCTGAAGG CCARAAAATT CCTAAAGTGG 650 

R659B3/genbank. =- AA ^AAAAA AG : 429 

^hced-6 Cim AGTEGCAAAT ATCAATTTAT GGAGTAAAAA TTCTAGAACC CAAAACAAAG 690 

ifTtia PCR fragment AGTTGCAAAT ATCAATTTAT GGAGTAAAAA TTCTAGAACC CAAAACAAAG 389 

hbc3123 EST clone , 

S'/R65aa2/genbank 478 

S VAAlS9394/g6nbank 415 

Consensus AGTTaCAAAT ATCAATITAT GGAGTAAAAA TTCT7VGAACC CAAAACAAAG 700 

R65983/genbank. r 429 

^hced-6 cDNA GAAGriTCAAC ACAATTGCCA GCTTCATAGA ATATCTTTTT GTGCAGATQA 740 

jfirTha PCR fragment GAAGTTCAAC AdAATTGCCA GCTTCATAGA AMCTTnT GTGCAGATaA 439 

itf'hbc3l23 EST Clone /JtyyvrroccA gcitcataga ATATcmrr gtgcagatga 39 

S'/RfiS8a2/genbank 470 

5'/AAl59394/genbank 415 

CCns^tSUS GAAGTrCAAC ACAATTGCCA GCTTCATAGA ATATCmTT GTGCAGATGA 750 

R659a3/gGnbank. 429 

^hced-6 tiDtOL TAAAACIGAC AAGAGGAn^VT TCACITTCAT ATGCAAAGAT TCTQVQTCAA 790 

//Tha PCR fragment TAAAACTGAC AAGAGGATAT TCACTTTCAT ATGCAAAGAT TCTOAGTCAA 489 

ff hbc3123 EST clone TAAAACTGAC AAiSAGQATAT TCACTTTCAT ATGCAAAGAT TCTGAGTCAA 89 

5»/R65a82/genbank . 478 

5'/AAlS9394/gonbank 415 

Consensus TAAAACTGAC AAGAGGATAT TCACmCAT ATGCAAAGAT TCTGAGTCAA 800 

Rfi59a3/genbank. ' 429 

^^iced-6 CDNA AO'AAACATTr GTGCTATGTA TTTGACAGCG AAAA gTGTG C TGAAGAGATC 840 

yTlie PCR fragment ATAAACAITT GTGCTATGTA TTTQACAGCO AAAAffTCTTGC TGAAGAGATC 539 

//hbc3123 EST clone XTAAACATTT GTGCIRTGTA TTTSACAGCG AAAAGTGTGC TGAAGAGATC 139 

S « /R6S 882 /genbanJc 47 6 

S * /AA1S93 94/genbanic 415 

Consensus ATAAACAITT GTGCTATGTA TTTGACAGCG AAAAGTGTGC TGAAGAGATC 850 

R65983/genbank. 429 

^ced-6 cDNA - ACTTTAACAA TTOGOCAAGC ATTTGACCTG GCATACACGA AATnCTAGA 890 

tfUhe PCR fragment ACTTTAACAA TTGaCCAAGC ATTTGACCTQ GCATAC\OGA AATTTCTAGA 589 

//hbc313a EST clone ACTTTAACAA TTOGOCAAGC ATTlw^^-Cra GCATACACGA AATTTCTAGA 189 

's*/R6S882/genbank r 479 

5'/AAl59394/gcnbank t 415 

Consenstis ACTTTAACAA TTGGCCAAGC ATTTGACCTQ GCATACACGA AATTJXrrAGA 900 
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R65983/genban}c. 

yTbe PGR fragment 
^ h]3c3123 EST clone 
5*/R65882/genhan)c 
5 ■ /AAl59394/genbflnk 

Consensus 

Rj65983/geribank. 
^ hced-6 cDNA 

Tha PCR fragment 
^ hbc3l23 EST clone 

5 VR6SaB2/giBnbank 

S ' /AiaS9394/genbank 

Consensus 



Figure 28 C 

hceci-6 alignment Formatted Alignment 



ATOVGGAGGA AAAGATGTTQ AAACMGAAA ACAGATCGCA. GGGTTACAAA 

ATCAGGAGGA AARGATOTTQ AAACAAGAAA ACAG 

A1CAGGAGGA AAAGATGTTCG AAACAAGAAA. ACAGJVTCGCA GGCTiTACAAA 



ATCAJGGAGGA. AAAtSATSTIG AAACAAGAAA ACAGATCGCA. 'GGGTTACAAA 



AAAtSAATCCA. AGACTTAGAA ACAGflAAATA TOGAACTTAA AAATAAAOTA 
AAASAATGCA AGACTISKGAA ACAGAAAATA TGGAACTTAA AAAHAAAlQTA 



429 
940 
623 
239 
478 
415 

950 



429 
990 
623 
289 
478 
415 



AAAOAATCCA AGACTTAGAA ACAGAAAATA. TGGAACTTAA AAATAAAGTA 1000 



R65983/genbank, 

hced-6 cONA 

The PCR fragment 

hbc3123 EST clone 

5*/Rfi5882/genbank 

S • /AAl59394/genbank 

Consenatis 



429 
1040 
623 
339 
478 
415 

CAAGAI'riQQ AAAACCAAGT a'/SAATAACT CAAOTATCAG CACCTCCAGG 1050 



CAAGATTTGG AAAACCAACT GAOAATAACT CAAOTATCAG CACCTCCAGC 
CAAGATITCSG AAAACCAACT GAGAATAACT CAAGCAICAG CACCTCCAGC 



R659e3/genbank. 
hced-6 cDNA 
The PCR fragment 
hbc3123 EST clone 
5 VR6Saa2 /genbank 
5 ' /AAlS9394/genbank 

Consensus 



429 
1090. 
623 
389 
478 

415 

AGGCAGTATG ACACCTAAGT CGCCCTCCAC TGACATCTTT GATATGATTC 1100 



AGGCAQTATG ACACCTAAGT CGCCCTCCAC TGACATCTTT GATATQATTC 
AGGCAGTATG ACACCTAAGT CGCCCTCCAC TGACATCTTT GATATGATTC 



R65983/genbank. 
hced-6 dDNA 
The PCR fragment 
hbc3123 EST clone 
5 * /R6S882 /genbank 
5 • /AAl59394/genbank 

Consensus 



429 
1140 
623 
439 
478 
415 

CATmCTCC AATATCACAC CAGXCTTCGA TGCCTACTCG CAATGGCACA 1150 



CATmCTCC AATATCACAC CAGTCTTCGA TGCCTACTCG CAATGGCACA 
CATTTICTCC AATATCACAC CAGTCTTCGA TGCCTACTCG CAATQOCACA 



R65983/genbank. 
hced-6 cONA 
The PCR fragment 
bbc3l23 EST clone 
5 ' /R658B2 /genbank 
5 ' /AAlS9394/genbank 

Consensus 



429 
1190 
623 
489 
478 
415 

CAOCCACCTC CAGXACCTAG TAGATCTACT GAGATTAAAC GQGACCTGTT 1200 



CAGCCACCTC CAGTACCTAG TAGATCTACT GAGATTAAAC GGGACCTGTT 
CRGCCACCTC CAGTACCTAG TAGATCTfJCT GAGATTAAAC GGGACCTGTT 



R6S9 8 3 /genbonk . 

hced-6 cONA 

The PCR fragment 

hbc3i23 EST clone 

5'/R6S8a2/gettbank 

S ' /AA159394/genbank 

Consensus 



TCGAGCAGAA CCmTGACC CATTTAACTG TQGAGCAGGA GATTTCCCTC 



429 
1240 
623 
539 
478 
415 

TCGAGCAGAA CCnTTQACC CATITAACTG TGGAGCAGCA GAnTOCCTC 12S0 



TGGAGCAGAA CCmTGACC dOTTPAOtO TGGAGCAGCA GATTTCCCTC 



R6S9a3/genban)c. 
hced-6 cISJA 
The PCR fragment 
hbc3l23 EST clone 
S*/R6Se82/genbank 
5 • /AA1S9394 /genbank 

Consensus 



CAGATATTCA ATCAAAATTA GATGAGATGC AGGAGGGGTT CAAAATOGGA 



429 
1290 
623 
589 
478 
415 

CAGATATTCA ATCAAAATEA GATCAGATGC AGGAGGGGTr CAAAATGGGA 130T3 



CAGATATTCA ATCAAAATTA GATGAGATGC AGGAGGGGTr CAAAATGGGA 



R6S9 83 /genbank. 
hced-G cDKA 
The PCR fragment 
hbc3123 EST clone 
5*/R6S882/genbank 
5 VAAl59394/genbank 

Consensus 



CTAACTCTTG AAGOCACAGT AITITGTCTC GACOCGITAG ACAGTAGGTG 



CTAACTCTTG AAGGCACACT ATmOTCTC GACCCGTTRG ACAGrAGCTTO 
CTAACrCTTQ AAGOCACAGT ATnTGTCTC GACCCGITAC ACAGTAGGTG 



429 
1340 
623 
639 
478 
415 
1350 
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R6S983/genbank. 
hced-6 cDNA 
•ThQ PCR fragment 
hbc3123 EST clone 
5 ' /R65882/geribank 
S * /AA1S93 9 4 /genbank 

Consensus 

R65983/genbanJc. 
hced-6 CDNA 
The PCR fragment 
hbc3l23 EST clone 
5 ' /R65882/genbanlc 
5 • /AAlS9394/genbank 

Consensus 
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hced-6 alignment Formatted Alignment 
"ir™ — *29 

CTOACATCAA GAACAAGAAA TOCTGATXCA TC3TTAAATGT CrTTGTATAC 1390 

CTGACATCAA GAACAAGAAA TCCTGATTCA TGTTAAATOT OTTIGTATAC 689 

CTOACATCAA GAACAAGAAA TCCTGATTCA TGTTAAATOT GmOTATAC 1400 



PCTAJS99/01361 



429 
1440 
623 
739 
478 
41S 

ACATGTCATT TATTATTATT ACTTTAAGAT AGGTATTAIT CATQTGTCAA 1450 



ACATGTCATT TATTATTATT ACTTTAAGAT AGGTATXATT CATGTGTCAA 
ACATGTCATT TATTATTATT ACTTTAAGAT AGGTATTATT CATGTGTCAA 



R£Sga3/genbank. 
hced-6 CDNA 
The PCR fragment 
hbc3123 EST clone 
5 • /R65882/gen]3anJc 
5 • /AAlS9394/genbank 

Consensus 

R65983 /genbank. 
hced-6 cONA 
The PCR fragment 
hbc3123 EST clone 
S * /R6S 882 /genbank 
5 • /AAl59394/genbank 

Consensus 



R65 983 /genbank. 
hced-6 cDNA 
The PCR fragment 
hbc3123 EST clone 
5 * /R65B82/genbank 
S - /AA159394 /genbank 

Consensus 

Re59a3/genbank. 
hced-6 CDMA 
The PCR fragment 
hbc3123 EST clone 
S ' /R65882/genbank 
S • /AAl59394/genbank 

Consensus 



429 
1490 
623 
789 
478 

TGTTTTrGAA TATTTTAATA TTTTGAAAAT TnCTCAGTT AAATTTCCTC 1500 



TGTTTTIGAA TATTTTAATA TTTTGAAAAT TTTCTCAGTT AAATTTCCTC 
TGTmTGAA TATTTTAATA TTTTGAAAAT TTTCTCAGTT AAATTTCCTC 



ACCTTCACTA TTGATCTGTA ATnTTATTT TAAAAACAGC TTACTTGTAAA 
ACCTTCACTA TTGATCTGTA ATiTITi^TTT TAAAAACAOC TTACTGTAAA 

ACCTTCACTA TTGATCTGTA ATnTTATTl' 'IAaAAACAGC TTACT'CTAAA 

CTAGATCATA CTITTATGTP CCTTrCTGTT TCTACTGTAG ATGAATTTGT 



GTAGATCATA CTl ' lT ATGrr CCTTTCTGTT TCTACTGTAG ATGAATTTGT 

GTAGATCATA CTTTTATGTT- CCTrTdUIT TCTACTGTAG ATGAATTTGT 

AATIGAAAGA CATATTATAC AAATACCTGC CTTGTCjTCTG AGTICTATrT 
AATTGAAAGA CATATTATAC AAATACCTGC CITGTGTCTG AGTrCTATTT 



429 
1S40 
623 
839 
478 
415 

1550 



429 
1590 
623 
889 
470 
415 

1600 



429 
1640 
623 
939 
478 
415 



AATIGAAAGA CATATTATAC AAATACCTGC CTTGTGTCTQ AGTTCTATTT 1650 



R6 59 83 /genbank. 

hced-6 cDNA 

The PCR fragment 

hbc3123 EST clone 

S'/R65882/genbank 

5 • /AA15 9394 /genbank 

Consensus 



AGTTAQCATC TTGAAATTTG TATTCATTIT CCAGATGGCT AGTTTATEAA 
AGTTAQCATC TTGAAATTTG TATTCATTTT CCAa^TGGCT AGTTTATTAA 



429 
1690 
623 
989 
478 
415 



AGTTRGCATC TTGAAATTTG TATTCATTTT CCAGATGGCT ACTTTATrAA 1700 



R6S983/genbank. 
hced-6 cCKA 
The PCR fragment 
hbc3123 EST clone 
5 '/R6S 882 /genbank 
5 • /AA159394 /genbank 

Consensus 



429 
1740 
623 
1039 
478 
415 

OCATTTOCCA AAAGCCATAC CTTAAAGATA ACmTTAAA TTCTQAAGAG 1750 



TGATTTCCCA AAAGCCATAC CTTAAAGATA ACmTTAAA TTCTGAAGAG 
TGATTTCCCA AAAGCCATAC CTTAAAGATA ACTTITTAAA TTCTQAAGAG 



R659a3 /genbank. 
hced-6 cDtCV 
The PCR fragment 
hbc3123 EST clone 
5 VR658 82 /genbank 
S • /AAi59394/genbank 

Consensus 



429 
1790 
623 
1089 
478 
415 

ACATGCCAAT GTCAAACTAA ACATGTTCTG TTTTTAAACC AACAAACATG 1800 



ACAIGCCAAT GTCAAACTAA ACATGTTCTG TTTTTAAACC AACAAACATG 
ACATGCCAAT GTCAAACTAA ACATGTTCTG I'l'l'lTAAACC AACAAACATG 



Figure 28D 
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R6S9B3/ganbank. 

hced-€ cDNA 

'ttiB PCR fragment 

hbc3123 EST clcnfi 

5*/R65aB2/genbank 

5 • /AAl59394/genban5p 

Consensus 



Figure 28E 

hc8d-6 alignment Formatted Alignment 



TIACTATTCA TTGaACAGAT ATCATTTTAT GTATAAATAG TGTICACAIC 



.429 
1840 
623 
1133 
478 

TEACTATTCA TTGGACAGAT ATCATnTAT GTATAAATAC teTTCACATC 18S0 



TTACTATTCA TTGGACAGAT ATCATTTCAT GTATAAATAC TGTICACATC 



R6S983/genbank. 

hced-6 cENA 

The PCR fragment 

hbc3123 EST clone 

5*/R65882/genbank 

5 ' /AAlS9394/genbank 

R55983/genbanlc. 
head- 6 cENA 
Tha PGR fragment 
hbc3123 EST clone 
S * /RfiSaaa/genbank 
5 • /AAl59394/genbahk 

Consensus 

R659B3/genbank. 
hced-6 cDNA 
The PCR fragment 
libc3123 EST clone 
5 * /R658e2/genbanlc 
S • /AAl59394/genbank 

Consensus 

R659a3/genbank. 

hced-6 cDNA 

Ihe PCR fragment 

hbc3123 EST clone 

5'/R65882/genbaiik 

5 ' /AAlS9394/genbank 

Consensus 



ACTGGGAAAA TGTAAACITP AAACATAATQ CCACAAGGTC ACTAATTICT 



429 
1890 
*623 
1189 
, 478 

ACTGGGAAAA TGTAAACrTT AAACATAATG CCACAAGGTC ACTAATTTCT 1900 



ACTGGGAAAA TGTAAACTTT AAACATAATG CCAGAAOGTC ACTAATTTCT 



429 
1940 
623 
1239 
478 

AGCAGGTAAA ATTATAAQGA TATAAATTCC AATaAx7AAC CAAATGTATT 1950 



AGCAGGTAAA ATTIATAAI3GA TAfEAAATTCC AATAATAAAC CAAATtSTATT 
AGCAGGTAAA ATTATAACa^ TATAAATTCC AATAATAAAC CAAATGTATT 



TAGAGTATTT ATEAGTAAAT GCAAGGTGAT GTTAGTTRTG ATCAGTTATA 
TAGAGTATTT ATTAGXAAAT GCAAGGTGAT GTEAGTEATG ATCACJITATA 



TAGAGTATTT ATTAGTAAAT GCAAKTIGAT GTTAGTTATG ATCAGTTATA 

CTCTRAATAT TTAATTTGTT TTATAAAGGT AGTGAAAAAA TGAAAATTTG 
CTCTAAATAT TTAATTTGTT TTATAAAGGT AQT3AAAAAA TGAAAATTTG 



429 
1990. 

623 
1289 

478 

415 

2000 



429 
2040 

623 
1339 

478 

41S 



CTCTAAATAT TTAATITCnT TTATAAAGGT AGTGAAAAAA TGAAAATTTG 2050 



R659a3/genbank. 

hcad-6 cJJNA 

The PCR fragment 

hbc3123 EST clone 

S*/R658a2/genbank 

5 ' /AAl59394/genbank 

Consensus 

R6S983/genban3c. 
hced-6 cWPi, 
Tba PCB Cragoenc 
hbc3123 EST clone 
5VR65882/genbank 
S • /AA159394/genbank 

Consensus 



CTATTTATIA AAAAACATTA AATTTCATTC CAAATGAGAT AAGTGATATT 
CTATTEATTA AAAAACATTA AATTTCATTC CAAATGAGAT AAGTGATATT 



CTftTTTATTA AAAAACATTA AATTTCATTC CAAATGAGAT AAGTGATATT 
ACTAIAACAT CTAAGCATCA TCTQATTTGA TAnCCCTAA AAAACATTTG 



ACTATAACAT CTAAGCATCA TCTGATTTGA TATTCCCTRA AAAACATTTG 
ACTATAACAT CTAAGCATCA TCTGATTTGA TATICCCTAA AAAACATTTG 



429 
2090 

623 
1389 

478 

415 

2100 



429 
2140 

623 
1439 

478 

415 

21S0 



R65983/gaibank, 

hced-6 cOttA 

The PCR fragment 

hbc3123 EST clone 

S*/R6Saa3/genbank 

S • /AAlS9394/genbank 

Consensus 



R6S983/genbank. 
hced-6 CDMA 
The PCR fragment 
htac3l23 EST clone 
5 * /R658a2/genbank 
5 * /AAlS9394/genbank 

Consensus 



GAATATATGC TATCTATAGA TTCAGTATCT ACTACOCATA TTEACTmr 
GAATATATGC TATCTATAGA TTCAGTATCT ACTACCCATA TTEACTTTAC 



GAATATATGC TATCTATAGA TTCAGTATCT ACTACOCATA TTEACTTTAC 

CAAATATATT TCTCCTCACT GCATAAGGAC TACICTTCTC ATATITTCTr 
CAAATATATT TCTOCTCACr GCATAAGGAC TACTCTTCTC ATATXTTCTT 



429 
2190 

623 
1489 

478 

415 

2200 



429 
2240 

623 
1539 

478 

415 



CAAATATATT TCTCCTCACT GCATAAGGAC 'ISVCTCTTCTC ATATriTCTT 2250 
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Figure 28F 

hced-6 alignment Formatted Alignment 

R659a3/genbank. — 

hcad-6 CDNR. CrrXGATCAA QATATTTTIC ACCfiAAGTIT ATiTiUlUKT QCCCTCTTGG 2290 

Tha PCa Cragment . — g23 

hbc3123 EST done CTTIGATGRA GATATXTnC AjDCAAAGlTT ATXTTGrQAT GCCCIOTTOQ 1569 

5VRfi5a82/genbaTik 473 

5'/AAlS9394/genbanJc -r 415 

Conaensus CTITGATCS^A GATAirmC ACCAAAGTIT ATinuitiAT QCCCTCTTGG 23 OQ 

R6S983/«enhank. " 429 

hced-6 CONA TiTlUATACT TTAAAATCTG TGGCACCCGT TCTRCATGAA TIATCAATAT 2340 

*Sh& PCR fragment — : — ~ — " "^ 623 

hbc3l23 EST clona TnTGATACT TTAAAATCTG TGGCACCCC?r TCTACATGAA TIATCAATAT 1639 

5'/R658e2/genban3c 478 

5VAAl59394/genbank 415 

Consensus TrntSATACT TTAAAATCTC TCJGCACCCGT TCTRCAT3AA TXATCAATAT .2350 

R65983/genbank. . 429 

hced-6 caJNA TIGGTAAATT CAATCTGTAT TrGTtTrGTT AAAGTCAAAA ATCTCATITT 2390 

tha PCR fragment — Z1Z~ — ^23 

hbc3123 EST clone TrGGrCATVATT CAATCTOTAT TTGTrTTGTT AAAGTCAAAA ATCTCATITr 1689 

5VR65a82/genbank ' 478 

5VAAlS9394/oanbank *- 415 

Consensus TIGGTAAATT CAATCTOIAT 'i'lViTi'iWiT AAAGTCAAAA' ATCTCATTTT 2400 

Rj6S9a3/genb2uik. . 429 

bced-6 CDtIA CCAAAAAAAA AAAAAAAAAA AC 2412 

The PCR fragment 623 

hbc3123 EST clone CCAAAAAAAA AAAAAAAAAA AC 1711 . 

S*/R65882/genbank 478 

S'/A\159394/genbank 415 

Consensus CCAAAAAAAA AAAAAAAAAA AC 2422 
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Figure 29 

Untitled-5 



Formatted Alignment 



hCED-6 original 
hrm- 6 / correc ced 

Consenstis 



hCED- €/ correc ted 
Consensus 



original 
hCED-6 / corrected 

Consensus 



hCED*6 origixial 
hCED- 6 / correct ed 

Consensus 



hrFD-6 original 
liCED-6 /corrected 

Consensus 



hCED-6 original 
hCED-6 /correc ted 

Consensus 




SO 
50 

SO 

100 
100 

XOO 

150 
ISO 

ISO 



200 
200 

200 



250 
250 

250 



300 
300 



hCEZ>-6 original 
hCED- 6 / correc ted 

Consensus 



304 
304 

304 
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Figure BOA 

Untitled-9 Formatted Alignment 



heed- 6 cDNA/codlno reg. 

hced-6 cDNAycoding reg. /correcce 



MXGAACCGTG CmTWCAC GAAGAAAGAC AAAACATOGA TGCAXACftOC 
niGnACCGTG CmXAGCAG (aAAOnAAGAC AAAACATOGft ^TOCATBrBCT 



y-*TT»rt»T>/-v^n/r r^arr^jtuf^a^ a&aBr*nrnrsrra *TryaTnr-*ftriH 



50 
SO 



bced-s 
heed- 6 



cDMA-Zcodina reg. 

cDNA/ coding reg. /correcce 



IXSAAGCTTTA 1CAAAACATT TCJU.'1UJCTIW- XftATGCaAW* 'X'i'l'UT'IVGC ft 
TDSftRGCrriaV. TCUUIACWIT ICAXITCGCnA ^CASGCJULAC- TTICrrSQCft 
'nffaftry^i' fj * ri^n&any^*T«Ti- frr^aornY-v'*-'^ Tn nnriT'ti. nne* ^l-'-jy r-ixl. 



100 
100 

100 



head- 6 cDNA/ coding reg. 

hced-6 cQMA./coding reg. /correcce 

Consensus 



gEaCROS^AGT GGRACAGGCA. AAAGGftACR& AA GlTCrGa g .j^Ga ^ULTiXji^rt 
SERCAGAAGT GSAACAGCC2W AAAOQAACAa AA^l'iyiliAa/ 



ISO 
150 

150 



hced-6 cDHA/ coding reg. 

hced-6 cCNA/ coding reg. /correcce 

Consensus 



hced-6 cDNA./ coding reg. 

hced-6 cDNA/coding reg. /correcce 

Consensus 

head- 6 cD^^^/ coding reg, 

hced-6 cDecU coding reg. /correcce 



XQCXftAAGTO GMSrrocaUUVi TMCaftTITR.: Tscau?^^ 



Consensus 



. GGaflGiTCAA- cftcaJorrGcc. a gcttca xrc AAT Aiitrrrrl 

> OG&AGTTCAA CACAAXTGOC-AQCTrcATAGT. AA33VXCrrn 
ftKf-'^Ra fjirr^arrTT v^T^n r^a^aa'AW^ &ry~T»iv^a«rafr &ai'n«TyT*T»r*T< 



200 
200 



2 SO 
250 



300 
300 

300 



hced-6 COMA/ coding reg. 

hced-6 cCNA/ coding reg. /correcce 

Consensus 



XGtlGCAGAItS ATAAAACTGA CAACAGGAIA. ^CA^CTTC^' TATCCTARCfi 
tCrtpCSiCXra ATAAAACTGA^ptRGAOSSA'Bl fTTCACTEIC^ 
i V jrTy-Vf-r5*rir! ^Tr^aat-r- n rsa r'aar;;*faT^'T«R" *Tnir*a/-«W't<f-'5* TOTyanaa^fza 



350 
350 

350 



hced-6 cXaiA/coding reg. 

hced-6 CDMA/ coding reg. /correcCe 

Consensus 

hced-6 CDKA/ coding reg. 

hced-6 cONA/ coding reg. /correcce 

Consensus 

hced-6 cDMA/coding reg. 

hced-6 cDNA/ coding reg. /correct e 

Consensus 

hced-6 cCNA/coding reg, 

hced-6 cCNA/ coding reg. /correcce 



rrCTGAgrCA AAaftAACATT TSTGCIRTGT ATEPGR CAGC GAAAA GTCTTG 
EPCiKaVGTCA AATAAftCATT TGTGCTATGT AirCTOACAGC^'GRAMCTOTG 
aaWarrPi'w >Tyynry TTwmryTT an^myrfcr^ary^ r-ga a arrtvynrrJ 



gTGAAGftGAl' CACTl'iaACA AaCTCSOCCAAg C ATSTC ACCT Q 



AAATTTCXAG AAJCAGGAGG AAAAGflaXnTT GAAACAAGaA AACaSATTCC 
KSATTTCEAG AATCAGGAOG ' AAAflGATGrT GAAACAACMt AJQStfSMqaC 

It;^.^ - , • . -'T- • '</■*• - ■ ■ '• *^ 

iraa'ir^i-TT'T.ari naTv-arys'r- y y aaaB,rt^'*v^yrr^l rrAAar-^armB. a]l.r^f3VTmTrt 



ftGGGTTACAA AAAAGAATGC AAXSA CTTA GA AACAGAAAAT. A TCGa A CITA 
aGGGTTACAA AAAAGAATCC AAGACTTAGA AACACJVAAJVI^ ATOG^^ 
"fniwrnf - t^ aaaarrymTT^ aar??t>^ftfrn B-erif^aftBn^- B'TlTOnr-T^ 



400 
400 



450 
450 



500 
500 



550 
550 

550 



hced-6 cDNA/ coding reg. 

hced-6 cDNA/coding reg. /correcce 

Consensus 

hced-6 COMA/ coding reg. 

hced-6 cONA/ coding reg. /correcce 

Consensus 

hced-6 cO&lA/ coding reg. 

hced-6 cONA/ coding reg . /correcce 

consensus 



heed 

heed 



-6 cOHA/ coding reg. 

-6 CDNA/coding reg . /correcce 



Consensus 

hced-6 cOCIA/ coding reg. 

hced-6 «»lA/coding reg- /correcce 

Consensus 

hced-6 cDNA/ coding reg. 

hced-6 cEKA/ coding reg. /correcce 

consensus 



RAMVIAAAGr ACAAGATTXG GAAAACCAAC lTSAGAA33iAC :-2CAAGrEATCA 
WWiWOEAAJSr ACAAGATTTG GAAAAOCAAC TGASMXl>>ML ;^t^C^K£CI< 
^if^y^m^nr^ R^ n T;/-^y.p>mr->- j-rt jvnary^ff^ ^marra aT-HBf- TT^ai- i^n-ir'-a 



aCAOCICCAG CBOOCACTBT GACACCTAAG 'rLXj LXA.ICC A- pTGftCA ICIT 

raccTCCAO <:^>a3Ci>ssraa gacaoctaac tcocccicca ctga^icti 



roATftJCGAa-i' CL:Aii'i'rn-ii, caatatcaca cca gicttcg a iiujcxa cic 
TSUTKBOMcr ccanncTC caaiatcaca ocagtcttos atoocteactc 



3CAATGGCAC ACAGOCAOCT OCACXftCClA tfUAG A'l^'JA C TGAGAaXAAA 
SCAASOGCaC ACAGCCACCT OCAGTACCTA GTAGATCXRC IGAGATXKAA 
ry^aaTTyy^ar^ ar^ranparr^ rr^rm ff^^ rmnn^TT'n./- tm^zzrm.Z.L. 



OGQGACCEGT TIGGAGCAGA ACCTTTTGAC CCA JTTA ACT GTOGAGCAGC 
MGGACCTGT TTGOAGCAGA. AOClTTrGAC CCATTEAACT- OTGaBGCAGC 

>2Q_jQ^*2ZSC!!&AC£LGZGa&GCACC 



RGAmOCCT CCAGATATTC AAXCAAAATT AGA.TGAGATC CAGGAGGGGI 
AGAITTCCCT CCAGKCVITC AATCAAAATT AGATGAGATG CAOaAGGGCI 



600 
600 



650 
650 

650 



700 
700 

700 

750 
750 



800 
800 



850 
8S0 



wo 99/37770 



PCT/US99/01361 



37/49 
Figure 30B 



Untltled-9 Formatted Aiignment 



hced-6 cDNA/coding reg. 

cDtDV/coding reg. /correcca 

Consensus 

hced-6 cXStPJcoi5±ng reg. 
Consensus 




900 
900 

900 



915 
915 

915 
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cor.sensus 
Seq 

aalS9334 



consensus 
Seer 



coass-sus 



G3TGATGAGC 
. .TGATGAGC 



CCTTGGGTTC TCGCTCCGAC 
CCTTGGGT7C TCGCTCCGAC 
CCTTGGGTTC TCGCTCCGAC 



GGTGATGACC 
. .TGATGAGC 



CCTTGGGTTC TCGCTCCGAC 
CCTTGGGTTC TCSCTCCGAC 



CTCCACCTTC 
CrCCACCTTC 
CTCCACCTTC 
CTCCACCTTC 
C-TCCACCTTC 

ctccaccttc 

ccagttcxct 
gcj=;gttctct 
c-cagttctct 

CCAGTTCTCT 
CCAGTTCTCT 



TCGTGGCCTC 
TCGTCGCCTC 
TCGTC-GCCTC 
TCGTGGCCTC 
TCGTGGCCTC 
TCGTGGCCTC 



ACTCGCC\CA 
ACTCGCC?.C.Ol 
ACTCGCCACA 
ACTCGCCACA 
ACTCGCCACA 
ACTCGCCACA 



TGCTAAATTC 
TGCTAAATTC 
TGCTAAATTC 
.GCTAAATTC 
TGCTAAATTC 
TGCTAAATTC 



CGGATCAGAA 
CGGATCAGAA 
CGGATCAGAA 
CGGATCACAA 
CGGATCAGAA 
CGGATCAGAA 



50 

GCTTGGCCGG 
-GCTTCGCCGG 
GCTTGGCCGG 
GCTTCCCCGG 
GCTTGGCCGG 
GCTTGGCCGG 

TCCGGAGCAG 
TCCGGAGCAG 
TCCGGAGCAG 
TCCGGAGCAG 
TCCGGAGCAG 
TCCGGAGCAG 



CTATTCTGAG CCTCCTGCGG 
CTATTCTGAG GCTCCTCCCC 
CTATTCTGAG GCTCCTCCCC 
CTATTCTGAG CCTCCTGCCG 
CCTCCTGCSC 



C TCCC GCG CTGACTTCCC 

C TCCC GCG CTGACTTCCC 
C , TGCCGGC . TGACTTCCC . 
C.TCCCGGC. TGACTTCCC* 
OrrCCOIGCG TGACTTCCCG 



FIGURE 32A 
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aa3 6 97H GCAG-rrCTWT CTATTGTCAC CCTCCTKCGG CTGCCGCCC XGACTTCCC 



Chcll7484 

aal593S4 



151 

TcrrcTGCWGC 

TGTGTGCGGG 
T3TGTGCGGG 
TGTGTGCGGG 
rGTGTGGNGG 



aa3 6 971<l -WTGTGCC3GQ 



AGGGAACrrCT 
AGGGAACrCT 
AGGGAACTCT 
AGGGAAClCr 
AGGGAACTCT 
AG<5GAACTCT 



•GGGCACGCTG 
GCGCAGGCTG 
GGGCAGGCTG 
GGGCAGGCTG 
GCGCAGGCTG 
CjGGCAGGCTG 



GTTTTCTTGG 

gttt-tcttgg 
gttttcttgg 
gttttcttgg 
GTTTTCTTGG 
CTrrnrrTGG 



2QQ 

AATGTGTTTA 
AATGTGTTTA 
AATGTGTTTA 
AATGTOTTTA 
AATGTGTTTA 
AATCTCTTTA 



r6S9fl3rcc 
r£r5SS2 

aa3637i4 



zax 

CGAT.GTTGA 



CGAT GTTGA 



ATSGGACTTG AACASG, .AA 



2S0 

GCTGGACGCT GCA.GCTGGA 



CGAT . GTTGA 
CGAT . GTTGA 
CGAT . GTTGA 
CGATTC3TTGA 



prim«r 0GAI03 

primer oGAlQ4 
ATCCCACTTG AACAGG AA GCTGGACGCT GCA GCTGGA 
• v/..,CrrG AAACGGGKAA CCGGGCCNCT GCAAGCWGGA 
ATGGGACTTG AACAGG. -AA GCTGGACGCT GCA.GCTGGA 
ATGGGACTTG AACAGG.. AA GCTGGACGCT GCA.GCTGGA 
ATGGGACTTG AACAGG. .AA GCTGGACGCT GCA.GCTGOA 
ATGGGACTTG AACAGG, ,AA GCTG<3ACGCT GCA 



251 

corucxisus ACTAGCGTGC C.AAGTTATT 

Seq ACTAGCGTGC C AACTTATT 

3C-S5 9aa3rcc ACTACCGTGC CCAAGTTATT 

thcll^^a-i ACTAGCGTGC C.AAGTTATT 

x-SSSa^ ACTAGCGTGC C.AAGTTATT 



TATGATTCC- ATCTGATATA 
TATGATTCC ATCTGATATA 
TATCAirCCCC ACtTTGATATA 
TATGATTCC . ATCTGATATA 
TATGATTCC- ATCTGATATA 
aalS93 94 ACTAGCGTGC G . AACTTATT TATGATTCC. ATCTGinCATA 



300 

CATACCAGAG 
CATAGCAGAG 
CATCCCAGAX3 
CATMGAGAjS 
CATAGGAGAG 
CATAGGAGAG 



CO r.sansus 



CGA102 

chcllT+a-i 
r6S9S2 
aalS93 34 



2CL 

AAACT GATA GAAGAATTCT 



AAACT GATA GAAGAATTCT 



350 

GATGGCAACT GTATGATAG A AGCTAT AT 
primer cGAX02 

GATGGCAACT GTATGATAG AAGCTAT AT 
-TA 



AAACT, CjATA GAAGAATTCT 
APACT.GATA GAAGAATTCT 
AW»iCT.OATA CAAGAATTCT 
AAACTTGATA GAAGAATTCT 



GATGGCAACT .GTATGATAG AACCTAT.AT 

GATGGCAACT .GTATGATAG AAGCTAT. AT 

GATCGCAACT .GTATGATAG AAGCTAT. AT 

GATGGCAACT .GTATGATAG AAGCTAT, AT 



0GAIO3 

r6S9a3rcc 

tlicll74a4 
aal53394 



AAAGTCAACT 
CAAGTCA.GT 
PAAGTCAAGT 
AAAGTCAAGT 
AAAGTCAAGT 
AAAGTC.aua.Gr 



GTCC A TTTT C 
GTCCATTTTC 
GTCCATTTTC 
GTCCATTTTC 
GTCCATTTTC 
GTCCATTTTC 
GTCCATTTTC 



TTTCAACTAT 
TTTCAACTAT 
TTTCAACTAT 
TTTCAACTAT 
TTTCAACTAT 
TTTCAACTAT 
TTTCAACTAT 



ATTTGAGCAT 
ATTTGAGCAT 
ATTTGAGCAT 
ATTTGAGCAT 
ATTTGAGCAT 
ATTTGAGCAT 
ATTTGAGCAT 



4Q0 

ACCCACGATT 
ACCCAGGATT 
ACCCAGGATT 
ACCCAGGATT 
ACCCAGGGTT 
ACCCAGGCTT 
ACCCAGGATT 



hCED-fi 



conaenaua TPtAGTCGTCC AACTGAACAT 

Se^ TAACrCCTGG AACTGAACAT 

CCA132 TAACTCGTGO AACTCAACAT 

ressaar-cc taagtcgtgg aactgaacat 

Chcll743-; TAACTCCTCC AACTGAACAT 

res 9 32 taagtcgtcc aactgaacat 

aal5g3 34 ' TAAGTCGTGG AACTGAACAT 



« N R 
4S0 

TTATTTCC CT CATCCTCATC ATG . AACCC T 

Primer 445 -1093^ -07-R 
TTATTTGGCT CATCCTCATC ATG AACCGT 
TTATTTCCCT CATCCTCATC ATG- AACCGT 
TTATTTGGCT CATCCTCATC ATG. AACCGT 
TTATTTGGCT CATCCTCATC ATGGAACCGT 
TTATTTGGCT CATCCTCATC ATGGAACCGT 
TAT 



CED-e 



MAKnrYKTFrC RSVSGIVCGV NINGSC-SSS? STSA5gVK-/R CCTG — 



FIGURE 32B 
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CED-ff 

hcro*€ 



CGJXS casus 

r£SSS3rc= 



coosensus 
Seq 
OC3AXQ2 
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A 
4Sl 
GCT 
CCT 



R 
K 



T 
T 



W 

w 



z 

M 



K P 
K T 



P 
P 



TTAGCA GGAAGAAAQA CAAAACATGG 
TTAGCA CGAAGAAAGA CAAAACATCG 
GCTTTTAGCA aOAACAAACA CAAAACATCS 
GCTTTTAGCA GGAAGAAAGA CAAAACATGG 



rTTAGCA GGAAGAAAGi 
rTTAGCA CGAACAAACA CAAAACATCG 
CCTTTTXSCA CCAAGAAAG-:^ CaJULACATGG 



ATCgATACAC 
ATGCATACAC 
ATGCILTACAC 
ATGCATACAC 
CTCCXKaCAC 
GTGCTKACAC 



- I L 

K K V 
SQQ 

CTCAACCTTT 
CTGAAGCTTT 
CTGAACCTTT 
CTGAAGCTTT 
CTGAAG-NTT 

CTGAAG-trrr 



IC 



G 
K 



K 
T 



V 
1 



F 



Y 



V 
N 



A 
A 



r 
r 



G 
G 



C 



V 
T 



5ICH TIP YNAX FLO STE 
5C1 SSO 

ATCAAAACAT TTCATTCCCT ATAATGCAAA CTTTCTTCCC ACTACACSAAG 

ATCAAAACAT TTCATTCCCT ATAATCCAAA CTTTCrrCGC ACTACAGAAG 

ATCAAAACAT TTCATTCCCT ATAATGCAAA CTTTCTTCGG ACTACAGAAG 



ATCAAAACAT TTCATTCCCT ATAATCCAAA CTTTCTTCGC AGTACAGAAG 

ATCAAAACN TTCTTTCCNA TTT 

ATCAAAAC.K TTCTTTCCilA TTT 



CCD'S 



TPXA KCS DVA RZAX KAX 
VSQP KGT EVV RDAV RKL 
S31 tfOO 

caexjB*r.cuJ5 TSCAJlCAGCC AAAAGGAACA GAAGTTGTGA GAGATGCTGT AAGGAAACTA 

Seer TGGAACAGCC AAAAGGAACA GAAGTTGTGA GAGATGCTGT AAGGAAACTA 

oGAlCx" TGGAACACCC AAAAGGAACA GAAGTTGTGA GAGATGCTGT AAGGAAACTA 

irSSSajrcC T-G^ACAGCC AAAAGGAACA GAAGTTGTGA GAGATGCTGT AAGGAAACTA 



CEO-S 



consensus 
OCAZC2 

r7e37a 



arc RDLX RSS ^TSISTAX L Q Xl V 

KF. A KKIK KSE GCK IPKV 

AAGTTTGCA?. GACATATCAA GAAATCXGAA GGCCAQAAAA TTCCTAAAGT 
A^GTTTGCAA GACATATCAA CAAATCTGAA CGCCAGAAAA TTCCTAAAGT 
AACTTTGCAA GACATATCAA CAAATCTGAA GGCCAGAAAA TTCCTAAAGT 
AACTTTGCAA GACATATCAA GAAATCTGAA GGCCAAAAAA AAAAAAAAG . 
.AGTTTGCAA GACATATCAA GAAATCTGAA GGCCAGAAAA TTCCTAAAGT 



ccs-« 



X 
L 



Q 



X 

I 



s 
s 



X 

I 



o 

r 



G 



V 
V 



z 



X 

I 



A 



D 

E 



I 
F 



K 



T 

est -'oo 

consensus GGACTTGCAA ATATCAATTT ATGGAGTAAA AATTCTAGAA CCCAAAACAA 

Secj C-GArrrTGCAA ATATCAATTT ATGGAGTAAA AATTCTAGAA CCCAAAACAA 

aCAlQ2 CGAGTTCCAA ATATCAATTT ATGGAGTAAA AATTCTAGAA CCCAAAACAA 

r7tf3Ti C-G.-.GTTGCAA ATATCAATTT . ATGGAGTAAA AATTCTAGAA CCCAAAACAA 



T, iK 7 VTF PLC *rsr CAD 

KEVC HKC CLK RISF CAC 
■?01 . 7SC 

eoAa«r.«us ACCAASTTCA ACACAATTCC CAGCTTCATA GAATATCTTT TTGTGCAGAT 
Saq ACG 

oGAiai AGG 

r7tf37i AGCAAGTTCA ACACAATTCC CAGCTTCATA GAATATCTTT TTGTGCAGAT 
dfi27ST CAATTCC CAGCTTCATAGCIP-ATATCTTOQGcraTCCACAT 



KC£0-tf 



EKC EKR« Far rAR«.2:cAS 
2KT CKRr rTF ICK CS££ 
7 5 i 800 

cease CIS US gataaaactc acaagagcat attcactttc atatgcaaag attctgagtc 

r753T- jATAAAAGTC ACAAGAGGAT attcactttc atatgcaaag ATT— wAGTC 
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CATAAAACTO 'aCAAGACGAT ATTCACTTTC ATATGCAAAG ATTCTGACTC 



CSO-fi C3CP SCV APTS JBXx. JLED 

hCSD-C KKH.LCy VFDS EKC AJEK 

901 850 

consenjcxis AiLa.T.^.a.aCAr TTGTGCTATC TATTTGACAG CGAAAAGTGT GCTGAAGAGA 

Seq CTGAAGACA 

aGAlQ2 i ,CTGAAGACA 

r7637a AAATAAACAT TTGTGCTATG TATTTGACAG CGAAAAGTGT GTAAGTATCC 

aa3079a2 GT GCTGAAGAGA 

da 2 76 7 AAATAAACAT TTGTGCTATG TATTTGACAG CGNAAAAGTGTGCTGAAGAGA 



cQuensus 

pCAlOl 
oGAlflS 

r7ff37a 

aa3 07932 
Cl527a7 



I T 
Z T 

TCArTTTAAC 



X 

I 



c 
G 



A F D 
A F D 



t. 
L 



A Y 



3C 
T 



K 



TCACTTTAAC 
TCACTTTAAC 
CACATCmrGT 
TCACTTTAAC 
TCACTTTAAC 



AArrGGCCAA GCATTTGA. . CCTGGCATAC 

AATTGCCCAA GCATTTGA CCTGGCATAC 
AATTGGCCAA GCATTTGA, , CCTGGCATAC 
AGGGCTTGCTT TGTTCTGTTT TAXAAG2TCC 
AATTGGCCAA GCATTTGA- - CCTGGCATAC 
AATTGGCCAA GCATTTGNN CTGGCATAC 



y 

F 
300 

AGGAAATTTC 
TC 

AGGAAATTTC 
AGGAAATTTC 
GGGGATTGTC 
AGGAAATTTC 
AGGAAATTTC 



hCHIJ'6 



CED-S 
KcSD-6 



oGAJ.01 
OGA102 

aa307992 
da27a7 



pGAlQl 

aCA107 
QGAXQ2 

r7^37a 

aa307933 
da2737 



5 



I. S 

sal 

TAGAA.TCAC 



2TRTS LEK QK QIVI 
GGKD VET QIAC 

5S0 

CACGAAiXAGA TGTTGAA AjCA AGAAAA C AGATCGGASG 

Pyjmejr eGA10 7-F 
Pr-imer 44S-X0934-Q4 

Priaier 445-1093^ -08-R 



T AGAA TCAG GAGGAAAACA TGTTGAAACA AGAAAA 
TAGAA TCAjG GAGGAAAAGA TGTTGAAACA AGAAAA 
TAGAA.TCAG GAGGAAAAGA TGTTGAAACA AGAAAA. 
TAGAA.TCAG GAGGAAAAGA TGTTGAAACA AGAAAA. . 
TTZGAA.TCAG GAGGAAAAGA TGTTGAAACA AGAAAA., 



C AGATCGCAGG 
C AGATCGCAGG 
-C AGATCGCAGG 
. C AGATCGCAGG 
.C AGATCGCAGG 



L K K 
Q K 

951 

OTTACAAAAA 
GTTACAAAAA 
GTTACAAAAA 

GTTJCCAAAil^ 

CTTG 

GTTACAJLAAA 
GTTACAAAAA 



I 
I 



V 
Q 



E 



L 
L 



E 
E 



T 
T 



E 
C 



if 



AGAATCCAAG 
AGAATCCAAG 
AGAATCCAAG 



ACTTAGAAAC AG.AAiATAT 
ACTTAGAAAC AG AAAATAT 
ACTTAGAAAC AG AAAATAT 



AGAATCCAAG ACTTAGAAA.C AGGAAAATAT 



V L I 
E I. S. 
1000 
GG AA-CTTAAA 
GGAACTTAAA 
GGAACTTAAA 
AAA 

GGAACTTAAA 



AGAATCCAAG ACTTAGAAAC AG. AAAATAT GGAACTTAAA 
AGACTCCANG ACTTAGAA?.C AC AAPATAT GGT 



csD-e 

tlCaD-6 



CEO-e 

hCSD-g 



pGAIQZ 
Seq 
QGA10 7 
OGA10 3 
aAD079a2 



E 



R 
V 



1C3Q1 

AATAA.2lGTAC 

aataaagtac 
aataaagtac 

AATAAAGTAC 
AATAAAGTAC 
AATAAAGTAC 



2 K 
A F 
105 1 



T 
P 



A 
D 



E 
£< 



A 
E 



R 
O 



A 
L 



R 



K 
T 



A 
C 



A, ACATTTGG 
A AGATTTGa 
A AGATTTGG 
A AGATTTGG 
A. ACATTTGC 
A . ACATTTGG 



AAAACCAACT 
AAAACCAACT 
A?AACCAACT 
AA?kACCAACT 
AAAACCAACT 
AAJIACCAACT 



CAGAATAACT 
GAGAATAACT 
GAGAATAACT 
GAGAATAACT 
CAKAA^TAACT 
GAGAATiACT 



G 
A 



P 
G 



P 
S 



X 
M 



T 



G 
K 



S 



F 



D y 
V S 
IQSO 
CAACTATCAG 
CAAGTATCAG 
CAACTATCAG 
CAAGTATCAG 
CAAGTATCAG 
CAAGTATCAG 

P S> A 

1100 
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pGAiai 

S«ct ■ 
aa307982 



CACCTCCACC 
CXCCTCCACC 
CATTCTCCAGC 
CACCTCCAGC 
CACCTCCAGC 
CACCTCCACC 



ACC-CA.. 
AGG CA 
AGG CA 
AGG CA 
AGGGCAA 
ACGCA. . 



.CT ATGACACCTA 
CT ATGACACCTA 
CT ATGACACCTA 
GT ATGACACCTA 
,CT ATGACACCTT 
,GT ATGACACCTA 



AG . . TCGCCC 
AG TCGCCC 
AG TCGCCC 
AG TCGCCC 
AAGTTCGCCC 
AG . . TCGCCC 



TCCACT.GAC 
TCCACT GAC 
TCCACT GAG 
TCCACT GAC 
TCCACTTGAC 
TCCACT. GAC 



CEb«6 
hCHD'-e 



IiSIiSPK P Q G 
I r D 

iioz 

ATCTTTGATA 
ATCTTTGATA 
ATCTTTGATA 
ATCTTTGATA 
ATCTTTGATA 
ATCTTTG, AT 



consensus 
pCAlOl 

oGAia7 
OGA1Q2 



P 
M 



I P 



I 
S 



P 

P 



P 

X 



s 
s 



s 



, TGATTCCAT 
TGATTCCAT 
TGATTCCAT 
TGATTCCAT 
ATCATTCCCT 
ATGATTCCAT 
CCAT 



TTTC . , TCCA 
TTTC TCCA 
TTTC TCCA 
TTTC TCCA 
TTTTCTTCCA 
TTTCT . . . CC 

iricr cc 



ATAT.CACJVC 
ATAT CACAC 
ATAT CACAC 
ATAT CACAC 
ATATTCACAC 
AATATCACAC 
AATATCACA 



I Y S 

g fi 5 

11 50 
C . AGTCTTC - 
C AGTCTTC 
C AGTCTTC 
C AGTCTTC 
CCAGTATTOr 
C - AGTCTTC - 
CCAGTCTTC- 



pGAlOl 

oGAi07 
oGAa02 
aa30T992 

aa431S9S 



M P 
1151 

GATCCCTAC , 



A M 

R N 



D L P 
G T C 



P 
P 



P 



P 



V 



, TCCCAAT . . GGCACACACC C . ACCTC-CA 



A P 
P 5 
1200 
GT2LCCTAGTA 



CATGCCTAC 
GATGCCTAC 
CATGCCTAC 
GATGCCTTCC 
GATGCCTAC . 
GATCCCTAC - 



TCCCAAT 
TCGCA3.T 
TCGCAAT 

TTCGCAATTG 

T-CGCAAT. . 

TCGCAAT 

, CAGCAAGTC 



CGCACACAGC 
CGCACACAGC 

GNACCACAGC 
CGCACACAGC 
CGCACACAGC 
AACATTTGAC 



primqg 44S -1093^-13 



C ACCTC CA 
C AjCCXC CA 
C ACCTC CA 
CCACCTTKCA 
C. ACCTC. CA 
C ACCTC CA 
ATATAGTTAT 



GTACCTACTA 

GTACCTAGTA 

GTACCTAGTA 

GTTCCTTAGT 

GTACCTAG . - 

GTACCTAGTA 

TTATTAGTTG 



CSD-fi 
hCED-6 



consensus 
pGAlCl 
Seq 
OGA10 7 
aGAlQ2 

aa431595 



T Xi P Q 

r, S T E 

1231 

CATCTACTGA 
GATCTACTGA 
GATCTACTGA 
CA27CTACTGA 

KAjrc 

GATCTACTGA 
ATC^AAGCAT 



I 
X 



K 



r 

R 



3 S 

0 L 



r 



G A 
G A 



S F 

£ .P 



GATTAAACGG CACCTGTTTG 
CATTAA^CGG GACCTCTTTG 
GATTAAACGG GACCTGrTTG 
GATTAAAJTGG CACCTGTTTG 



GAGCAGAACC 
GACGACAACC 
GAGCAGAACC 
GAGCAGAACC 



S V s 
F D P 

12S0 
TTTTGACCCA 
TTTTGACCCA 
TTTTGACCCA 
TTTTGACCCA 



GATTAAACGG CACCTGTTTG 
GAATATTTCA ACTTTAGTGT 



GAGCAGAACC TTTTGACCCA 
TCACTGATTT TATTTTGCTG 



hCSO-6 



PAS TSPS CPA 
r^fC GAAD FFP 
1251 

TTTA ACTGTG GAGCAGCAGA TTTCCCTCCA 
Pr-imt=r oGAlOg-F 
pGAiai TTTAACTGTG GACCAGCAaA TTTCCCTCCA 
Sc<r TTTAACTOTG GAGCAGCAGA TTTCCCTCCA 
<sCAlQ7 TTTAiACTGTG GAC-CACCAGA TTTCCCTCCA 
M 4-433^3 TTTA-\CTCTC GAGCAGCAGA TTTCCCTCCA 
aaa3l993 TAACATTT CATACTC TTTTTTA CA 



P S I P P P A 
□ X Q S K L D 
1300 

GATATTCAAT CAAAATTAGA 



GATATTCAAT 
GATATTCAAT 
GATATTCAAT 
GATATTCAAT 
GATATTAATT 



CAAAATTAGA 
CAWAATTAGA 
CAAAATTAGA 
CAAAATTAGA 
ATTTTATTCT 



CED-6 S T S 

hCEO-e £ M Q 

13C1 

consensus tgacatgcac 

pGAIOl TGACATGCAC 

Seq TGACATGCAC 

oGAl 0 T TGAGATGCAC 

oGAiaa G 



P S ' G PAPS 
Z O W K M G E. 

GACGGGTTCA AAATGGGACT 
GAGGGGTTCAAAAATGGGACT 
GAGGGGTTCA AAATGGGACT 
GACGGGTTCA AAATGGGACT 
GACCGGTTCA AAATGGGACT 



I P P P R P 
T D E G T V 

13S0 

AACTCTTCAA GGCACAGTAT 
AACTCTTGAAACCCACAGTAT 
AACTCTTCAA GGCACAGTAT 
AACTCTTGAA GGCACAAGTAT 
AACTCTTCAA GGCACAGTAT 
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1992 



rCASATGCAC CAGGGGTTCA AA^^TCGGACT AACTCLLC3AA GGCACACxTAT 
CTTTTA.CAjC GAGGGCTTCA AAATGGGACT AACTCTTGAA GGCACAGTAT 



csn-ff 

hCED-6 



CO OS ens US 

eGAlOS 
aa4433S3 



I. 

Ij 



V A 

F C 
1351 

TTT3TCrrCGA 

TTTGTCTCGA 
TTTGTCTCGA 
TTTGTCTCCA 
TTTGTCTCGA 
TTTGTCTCCA 



P P P 
P L D 

CCCGTTAOAC 

CCCGTTACAC 
CCCCTTAGAC 
CCCGTTAGAC 
CCCGTTAGAC 
CCCGTTAGAC 



P V 
3 R 



A ^, 
C 



AGTAC CTGCT GACATCAACA 

primer 445*10934 



1400 
ACAAGAAATC 



■11-g 



AGTACCTGCT GACATCAAGA 
AGTAGGTGCT GACATCAACA 
AGTAGGTGCT GACATCAAGA 
AGTAGGTGCX GACATCAAGA 
AGTAGGTGCT GACATCAAGA 



ACAAGAAATC 
ACAAQAAATC 
ACAAGAAATC 
ACAAGAAATC 
ACAAGAAATC 



CEO-€ JRJIOTWS PKNSTfi.GL!LO GLELGSAHPA KSCAPSJTirDa 

SroPRAGEKX STAASTOPrC ADFLSatQNG KSAPPSA2A3 LLASEAXA^U* PKPESSSVP^ 
CED-e KXTAAZYDAM ISIEVEKXrJVA MSSQSFEFGQ LQTGDLGGIS GMOVCTPSD Rr.2^^»iC^^OII4KQ 



c:ociscnsx;a 

Sccj 
oGAlO? 
oGAXOg 
oGAl 0 a 

aA431393 
r33 3a9 



14C1 

CTGATTCATG 



TTAAATGTGTTTGTATAC . A 



primer oGAlQ9-F 



CTGATTCATG 
CTGATTCATG 

CTGATTCATG 
CTGATTCZ.TG 
CTGATTCATG 
etCATTCATG 



TTAAATGTCT 
TTAAATGTGT 

TrriAATGTGT 
TTAAATGTCT 
TTAAATGTGT 
TTAAATGTCT 



TTGTATAC A 
TTGTATAC A 

TTGTATAC A 
TTGTATAC, A 
TTGTATAC. A 
TTGTATAC , A 



1450 

CATCTCATTT ATTATTATTA 



CATGTCATTT 
CATGTCATTT 
ACTGTTCATT 
CATGTCATTT 
CATGTCATTT 
CATGTCATTT 
CATGTCATTT 



ATTATTATTA 

ATTATTATTA 

ATTATTATT 

ATTATTATTA 

ATTATTATTA 

ATTATTATTA 

ATTATTATTA 



consensus 

OGA107 
OCA10 3 
aGAiOa 
aa4433 £9 
aa43199£ 

r6223£ 
1x03749 
r33335 



1451 

CTTTAAGATA 

CTTTAAGATA 

CTTTAAGATA 

CTTTA^RIAA 

CTTTAAGATA- 

CTTTAAGATA 

CTTTAAGATA 



GGTATTA.TT 
GGTATTA TT 
GGTATTA TT 
QGTTATTATT 
GGTATTA TT 
GGTATTA TT 
GGTATTA TT 



AAGATA GGTATTA.TT 

. . .TAACATA GGTATTA.TT 
CTTTAAGATA GGTATTA.TT 



XSQO 

CATGTGTCAA TGTTTTTGAA TATTTTAATA 
CATGTGTCAA TGTTTTTGAA T-XTTTTAATA 
CATGTGTCAATGGTTTTTTGAATATTTTAATA 
NTGCGOTCA GSTrTTTSXTJUS. TATTTTAATA 
CATGTGTCflA TGTTTTTGAA TATTTTAATA 
CATGTGTCAA TGTTTTTGAA TATTTTAATA 
CATGTGTCAA TGTTTTTGAA TATTTTAATA 

GATA 

CATGTGTCAA TGTTTTTGAA TATTTTAATA 
CATGTGTCAA TGTTTTTGAA TATTTTAATA 
CATGTGTCAA TGTTTTTGAA TATTTTAATA 



CO CIS ens us 

GGAIQS 
OGAlOa 
Aa443 3 €3 
aA43 1933 
rS233 Z. 
rS2235 
ho 3 74 9 
r33 3 35 



ISOl 

TTTTGAAAAT 
TTTTGAAAAT 
TTTtrrAAAAT 
TTTTGAAAAT 
TTTTGAJVAAT 
TTTTOAAA.^T 
TTTTGA-AAAT 
TTTTGAAA-\T 
TTTTGAAAAT 
TTTTG, 



TTTCTCAGTT 
TTTCTCAGTT 
TTTCTCAjMTT 
TTTCTCAGTT 
TTTCTCAGTT 
TTTCTCAGTT 
TTTCTCAGTT 
TTTCTCAGTT 
TTTCTCACTT 



AAATTTCCT. 

AAATTTCCT 

AAATTTCCT 

AAATTTCCT 

AAATTTCCT 

AAATTTCCT 

aaItttcct . 

AiiTTTCCT. 
AA;i.TTTCCT . 



CACCT . 
CACCT 
CACCT 
CACCT ••' 
CACCT 
CACCT 
CACCT. . 
CACCT. 
CACCT . . 
CACCT. . 



ISSO 

T CACTATTGAT 
T CACTATTGAT 
T CACTATT^rNAT 
T CACTATTGAT 
T CACTATTGAT 
T CACTATTGAT 
T CACTATTGAT 
T CACTATTGAT 
T CACTATTGAT 
T CACTATTGAT 



coasen-sus 



1S51 

CTCTAATTTT TATTTTAAAA ACAGCTTA CT G. 



ItfCQ 

TAAAGT AC A. . TCATA 



S^^ CrSTAATTTT TATTTTAAAA ACAGCTTACT G TAAAGT ACA TCATA 
COAiC? CGTTAATT^ TA Vl ' i L P^SKA Ac:rTCTTAC>f T T.^A-MTT TCATA 
oCAia3 CTGTAATTTT TATTTTAAAA ACAGCTTACT C TAAAGT AC-A TCATA 
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aa4 4 31£a 
rS38ai 



CTGTAATTTTT 
CTGTAA I LLi 
CTGTAATTTT 
CrCiAATTTT 
CTjTAATTTT 
CTCTAATTTT 



T A TT T T A AAA AjCAGCTTACT GT 
TATTTTAAAA 
TATTTTAAAA 
TATTTTAAAA 
TATTTTAAAA 

TATTTTAAAA ACAXICTTACT G. . .TAAAGT AGGA.TCATA 



ACACCTTACT C 
ACAGCTTACT G. 
ACAGCTTACT G. 
ACAGCTTACT G. 



TAAAGT AG A TCAXA 
.TAAAGT AG.A.TCATA 
.TAAAGT AG.A.TCATA 
TAAAGT AG.A.TCATA 



oCAlQS 

oGAioa 

aA4.3XdSS 

rS3fiai 

r£223£ 
r333e9 



1501 

crrrr. 



CTTTT 

CTTTT 

CTTTT 

CTTTT 

CTTTT- 

CTTTT- 

CTTTT, 

CTTTT . 



■ ATG TTCCTTTCTG TTTCTACTCT AGAT.,CAAT TTGTAATTGA 
ATG TTCCTTTCTG TTTCTACTCT AGAT GAAT TTGTAATTGA 
AKN TTCCTTTCCA TTTCTACGC TNKA CSTAA TTGHTAATTATA 
ATG TTCCTTTCTG TTTCTACTGT AGAT GAAT TTGTAATTGA 
ATG TTCCTTTCTG TTTCTACTGT AGAT GAAT TTGFTAATTGA 
.ATG TTCCTTTCTG TTTCTACTGT AGAT. , GAAT TTGTAATTGA 
.ATG TTCCTTTCTG TTTCTACTGT AGAT. '.GAAT TTGTAATTGA 
.ATG TTCCTTTCTG TTTCTACTCT AGAT. .GAAT TTGTAATTGA 
-ATG TTCCTTTCTG TTTCTACTGT AGGATGGAAT TTGTAATTGG 



1521 1700 
•consensus AAC.ACATAT TATACAA ATA CCTCCCTTCT GTCTGAG .TT CTATTTACTT 

grimer ^45-10934 «0g*? 
S<=q AAC- ACATAT TATACAAATA CCTGGCTTGT GTCTGAG TT CTATTTACTT 
CGA10 9 Airr ACATAT TATACAAATA CCGACCTTANGATCTSnJOTTT CTATTTATTT 
oGAiOa AAC ACATAT TATACAAATA CCTGCCTTGT GTCTGAG TT CTATTTAGTT 
aa4 319 9S AAC ACATAT TATACAAATA CCTGCCTTGT GTCTGAG TT C 

rsaaai aac.acatat tatacaaata cctgccttgt gtctgag.tt ctatttagtt 

rff22 3S A-lG. ACATAT TATACAAATA CCTGCCTTGT GTCTGAG.TT CTATTTAGTT 
ha 3 74 9 AAC.ACATAT TATACAAATA CCTGCCTTGT GTCTGAG.TT CTATTTAGTT 
r3 3 33 9 AAC-GACATAT TATACAAATA CCTGCCTTGT GTCTGAGGTT CTATTAGGTA 



cac3er.sus 
OGA10 9 

oOAzaa 

oGAilO 
r52331 
rfi223£ 
h037-ia 



AAATTTGTA" 



AC-r ATCTTG KAATTTGTAT 
trZZ ATCTGT AAATTGATAT 
AG- ATCTTG AAATTTGTAT 

AGC..C.TCTTG AAATTTGTAT 
AC-C- ATCTTG AAATTTGTAT 
AC^. ATCTTG AAXTTTCTAT 



1750 

TC AT T' lT CCA GATGGCTAG T TTATTAATGA 

rimer oGAiio-F 

TCATTTTCCACGATGGCTAGT TTATTAATGA 
TCATTTTCCA . TAGGSTCTCTTTTATTAAjGSIAT 
TCATTTTCCA GATGGCTAGT TTATTAATGA 

CTTTAATGA 
TCATTTTCCA GATGGCTAGT TTATTAATGA 
TCATTTTCCA OATGGCTAGT TTATTAATGA 
TCATTTTCCA GATGGCTAGT TTATTAATGA 



r33 3 39 GC-CCATCTGG AAATTTGTAT TCATT. 



17S1 

efonsea^us TTTCCCAAAA GCCATACCTT AAAG.ATAAC 

Secj TTTCCCAAAA GCCATACCTT AAAG ATAAC 

QGA10 5 TTCCCAAAN TCCATACCTT PJ^T ATAAC 

cGAaoa TTTCCCAAAA GCCATACCTT AAAG ATAAC 

CCAllS ■ TTTCCCAAAA GCCATACCTT A=J>iG AT,\AC 

r33 3ai TTTCCCAAAA GCCATACCTT AAAC.ATAXC 

r»a2 3S TTTCCCAAAA GCCATACCTT AAAC.ATAa.C 

ho 3 74^ TTTCCCAAAA GCCATACCTT AiACGA: 



laoo 

TTTTTAAATT CTGAAGA. .G 
pclcnez- 4-45 -10934-12 -R 
CTGAAGA G 
TNTAATA T 
CTGAAGA G 
CTGAAGA G 
CTGAACG . . G 
CTGGAAGA.G 
CTCGAAGGUG 



TTTTTAAATT 
TTTTTAAATT 
TTTTTAAATT 
TTTTT.=AATT 
TTTTTAAXTT 
TTTTTAiATT 



1^0 1 

ccnsensvis * ACATGCcaat C TCAAACTAA 

Seq: ACATGCCAAT GTCAAACTAA 

GGA1C3 ACArrrCCAA GTTCAAACTAA 

oCAlIO ACATGCCAAT GTCAAACTAA 

oQAioa ;;cATwcrAAT gtcaaactaa 

r53a8 1 ACATGCCAAT GTCAAACTAA 

r^223<5 ACATGCCAAT GTCAAACTAA 



ACATGTTCTG TTTTTAAA . C 
ACATGTTCTG TTTTTAAA C 
ACAiVT^fTTCGiy TTTTTAAJi. C 
ACATGTTCTG TTTTTAAA C 
ACATGTTCTG TTTTTAA*. C 



1950 
CAACAAACAT 
CAACAAACAT 
CAACAAACA^ 
CAACWiCAT 
CAACAAACAT 
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